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(57) Abstract 

A method of increasing the production of lovastatin or monacolin J in a lovastatin-producing or non-lovastatin-producing organism 
is disclosed. In one embodiment, the method comprises the steps of transforming an organism with the A. terreus D4B segment, wherein 
the segment is translated and where an increase in lovastatin production occurs. 
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METHOD OF PRODUCING ANT I HYPERCHOLESTEROLEMIA AGENTS 
CROSS-REFERENCES TO RELATED APPLICATION 
Not applicable. 

STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH AND DEVELOPMENT 

This invention was made with United States 
government support awarded by the following agencies: NIH 
Grant No: AI43031. The United States has certain rights 
in this invention. 



Cholesterol and other lipids are transported in body 
fluids by low-density lipoproteins (LDL) and high-density 
lipoproteins (HDL) . Substances that effectuate 
mechanisms for lowering LDL-cholesterol may serve as 
effective ant ihypercholesterolemic agents because LDL 
levels are positively correlated with the risk of 
coronary artery disease. 

MEVACOR (lovastatin; mevinolin) and ZOCOR 
(simvastatin) are members of a group of active 
antihypercholesterolemic agents that function by 
inhibiting the rate- 1 imi ting step in cellular cholesterol 
biosynthesis, namely the conversion of 

hydroxymethylglutarylcoenzyme A (HMG-CoA) into mevalonate 
by HMG-CoA reductase. 

The general biosynthetic pathway of a naturally 
occurring HMG-CoA reductase inhibitor has been outlined 
by Moore, et al., who showed that the biosynthesis of 
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mevinolin (lovastatin) by Aspergillus cerreus ATCC 20542 
begins with acetate and proceeds via a polyketide pathway 
(R.N. Moore, et aJL . , Amer . Chem, Soc . 107:3694-3701, 

1985) . Endo, e_t aJL . described similar biosynthetic 
5 pathways in Pencillium citrinum NRRL 8082 and Monascus 

ruber M-4681 (A.Y. Endo, ejt al . , J_^. Ant ibiot . 38:444-448, 
1985) . 

The recent commercial introduction of microbial HMG- 
CoA reductase inhibitors has fostered a need for high 
10 yielding production processes. Methods of improving 
process yield have included scaling up the process, 
improving the culture medium and simplifying the 
isolat ion . 

Previous attempts to increase the biosynthesis of 
15 HMG-CoA reductase inhibitors at the level of gene 

expression have focused on increasing the concentration 
triol polyketide synthase (TPKS) , a multifunctional 
protein with at least six activities as evidenced by the 
product of the enzymatic activity (Moore, supra , 1985) . 
20 TPKS is believed to be the rate- limiting enzymatic 
activity (ies) in the biosynthesis of the HMG-CoA 
reductase inhibitor compounds. 

U.S. patent 5,744,350 identifies a DNA encoding 
triol polyketide synthase (TPKS) from Aspergillus 
2 5 terireus. "NPKS" is now preferred to TPKS as the acronym 
for "nonaketide polyketide synthase." 
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In one embodiment, the present invention is a method 
of increasing the production of lovastatin in a 
lovastat in-producing organism. The method comprises the 
steps of transforming the organism with a nucleic acid 
sequence comprising the D4B segment, preferably 
comprising nucleotides 579 - 33,000 of SEQ ID NO: 18 and 1 
- 5,349 of SEQ ID NO: 19.. The nucleic acid sequence is 
transcribed and translated and an increase in lovastatin 
production occurs. Preferably, this increase is at least 
2-fold. 

In a preferred form of the present invention, the 
lovastat in-producing organism is selected from the group 
consisting A . Cerreus ATCC 20542 and ATCC 20541. 

In another embodiment, the method comprises the step 
of transforming the organism with the corresponding D4B 
segment isolated from a non-A. terreus lovastatin- 
producing organism . 

In another embodiment, the present invention is a 
method of increasing the production of lovastatin in a 
lovastat in-producing organism, comprising the step of 
transforming the organism with the LovE gene, wherein the 
nucleic acid sequence is transcribed and translated and 
wherein an increase in lovastatin production occurs. 

In another embodiment of the present invention, one 
may increase the production of monacolin J in a non- 
lovastatin-producing organism comprising the steps of 
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transforming the organism with a nucleic acid seauence 
comprising the D4B segment. As a further step, one may 
additionally transform the organism with an entire LovF 
gene. If the entire LovF gene is added to the D4B 
5 segment, the organism will produce lovastatin. 

In another embodiment, the present invention is the 
lovastatin production gene cluster, preferably SEQ ID 
NOs:18 and 19, and the individual genes comprising that 
cluster . 

10 It is an object of the present invention to provide 

a method for increasing lovastatin and monacolin J 
production in both lovastat in-producing and non- 
lovastatin producing organisms. 

Other objects, features and advantages of the 

15 present invention will become apparent after review of 
the specification, claims and drawings. 

DESCRIPTION OF DRAWINGS 

Fig. 1 is a diagram of lovastatin production genes. 
Fig. 2 is a schematic diagram of a hypothetical 
20 mevinolin/lovastatin biosynthesis pathway. 

Fig. 3 is a comparative diagram of statins. 
Fig. 4 is a schematic drawing of plasmid 
pWHM1264/CB24A. 

Fig. 5 is a schematic drawing of plasmid pWHM1424 . 
25 Fig. 6 is a schematic drawing of plasmid 

CD4B/pWHM1263 . 
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In General 

The Examples below disclose the cloning and 
sequencing of a cluster of 17 genes from A. terreus ATCC 
20542, a strain that natively produces lovastatin (See - 
Fig. 1) . These genes flank the NPKS gene, which is known 
to be required for lovastatin production (see, for 
example, U.S. patent 5,744,350). 

The DNA sequence of the cluster has been determined 
and is disclosed below at SEQ ID NOs:18 and 19. 
Mutations in four of the genes (P450I/LovA, SEQ ID NO:22; 
dehydrogenase/LovC, SEQ ID NO: 24; esterase/Lov£>, SEQ ID : 
NO: 25/ and ScPKS/LovF, SEQ ID NO: 2 9) have been isolated 
and demonstrate that each of these four individual genes 
is required for lovastatin production. These genes are 
indicated with an X symbol in Fig. 1 and referred to 
herein as the "A. terreus lovastatin gene cluster." 

Another of the genes (Zn Finger I/LovK, SEQ ID 
NO: 27) is thought to regulate the transcription of the 
other genes and causes a notable increase in lovastatin 
production when reintroduced into A . terreus ATCC 20542. 

Applicants have used the following convention in 
naming the genes and proteins of the present invention. 
The genes and proteins are first named with either an 
"ORF" or u Lov" prefix and then named either numerically 
or alphabetically. u Lov" signifies genes shown to be 
essential for lovastatin production. Applicants have 
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also included a descriptor name that describes a orobabie 
function of the protein. For example, SEQ ID NO : 1 is 
described as the "ORFl/esterase-like protein" because 
Applicants have compared the amino acid sequence to known 
5 esterases . 

The portion of the gene cluster between 
ORFl/esterase-like protein and the mid-region of 
LovF/SCPKS is referred to as the "D4B segment". The A, 
terreus D4B segment is contained within a plasmid clone 

10 deposited as ATCC 98876. As described below, other 

lovastat in-producing organisms contain an analogous D4B 
segment comprising analogous genes. The present 
invention comprises a M D4B segment" isolated from other 
lovastat in-producing organisms. The arrangment of the' 

15 genes within the D4B segment may be different in other 

organisms. We predict that the genes within these other 
segments will have at least 80% homology, at the nucleic 
acid level, with the genes disclosed herein. We envision 
that each of these lovastat in-producing organisms will 

20 comprise within their genomes a LovA, LovB, LovC, LovD, 
LovE and LovF gene . 

We have determined that the D4B segment will confer 
production of monocolin J if the genes are all expressed, 
as we show below in an example using A . nidulans . We 

25 envision that adding the LovF gene to the D4B segment 
genes will result in the production of lovastatin in a 
non- lovastat in-producing organism. 
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Table 1, below, summarizes information regarding the 
different protein and nucleic acid sequences of the 
present invention.. SEQ ID NOs:l-17 are predicted 
translation products of various members of the gene 
cluster. SEQ ID NOs:18 and 19 are the entire DNA 
sequence of the gene cluster. SEQ ID NOs:21-36 are the 
genomic DNA sequences of the various members of the gene 
cluster and include the introns. These DNA sequences are 
reported in the Sequence Listing in the 5' - 3' 
orientation, although, as Fig. 1 indicates, some of these 
DNA sequences are in the inverted orientation in the 
actual cluster. 



TABLE 1 



SEQ ID NO. 


DESCRIPTION 


COMMENTS 


SEQ ID NO: 1 


Predicted amino acid sequence 
of ORF 1 /Esterase-like protein 


Translation of 6 EXONS 6865- 
6568, 6462-5584, 5520-4822, 4774- 
3511,3332-2372,2301-1813 
(reverse complement) 
FROM SEQ ID NO: 18 


SEQ ID NO: 2 


Predicted amino acid sequence 
ofORF2 


Translation of 1 EXON 7616-8602 
FROM SEQ ID NO: 18 


SEQ ID NO: 3 


Predicted amino acid sequence 
of LovA/P4501 protein 


Translation of 1 EXON 10951-9980 
(reverse complement) 
FROM SEQ ID NO: 18 


SEQ ID NO: 4 


Predicted amino acid sequence 
ofORF5 


Translation of 1 EXON 22760- 
21990 (reverse complement) 
FROM SEQ ID NO: 18 


SEQ ID NO: 5 


Predicted amino acid sequence 
of LovC/Dehydrogenase 


Translation of 3 EXONS 23158- 
23717, 23801-23912, 23991-24410 
FROM SEQ ID NO: 18 


SEQ ID NO: 6 


Predicted amino acid sequence 
of LovD/Esterase 


Translation of 3 EXONS 26203- 
26080, 26005-25017, 24938-24810 
(reverse complement) 
FROM SEQ ID NO: 18 
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SEQ ID NO. 


DESCRIPTION 


COMMENTS 


SEQ ID NO: 7 


Predicted amino acid sequence 
of ORF8/HMG Co A Reductase 


Translation of 5 EXONS 3006^- 

29882, 29803-29745, 29664-271 19 

27035-26779, 26722-26559 (reverse 

complement) 

FROM SEQ ID NO: 18 


SEQ ID NO: 8 


Predicted amino acid sequence 
of LovE/Zn Finger I 


Translation of 1 EXON 31360- 
32871 

FROM SEQ ID NO: 18 


SEQ ID NO: 9 


Predicted amino acid sequence 
of ORFIO/Metabolite transport 


Translation of 8 EXONS 1400- 
1452, 1619-1695. 1770-1996.2065- 
2088, 2154-2225, 2332-2865. 2939- 
3099, 3180-3560 
FROM SEQ ID NO: 19 


SEQ ID NO: 10 


Predicted amino acid sequence 
ofLovF/ScPKS 


Translation of 7 EXONS 4430- 
4627, 4709-4795, 4870-4927. 4985- 
5318, 5405-5912, 5986-6565. 6631- 
12464 

FROM SEQ ID NO: 19 


SEQ ID NO: 1 1 


Predicted amino acid sequence 
of ORF12 


Translation of 3 EXONS 13596- 
13496, 13451-13063, 12968-12709 
(reverse complement) 
FROM SEQ ID NO: 19 


SEQ ID NO: 12 


Predicted amino acid sequence 
ofORF13/Zn Finger II 


Translation of 5 EXONS 16608- 

16463, 16376-15572, 15519-15346, 

15291-14825, 14767-14131 (reverse 

complement) 

FROM SEQ ID NO: 19 


SEQ ID NO: 13 


Predicted amino acid sequence 
of ORF14/Acetyl CoA 
transport protein 


Translation of 7 EXONS 19642- 

19571, 19502-19427, 19352-19227, 

19158-19011, 18956-18663.18587- 

18438, 18380-18341 (reverse 

complement) 

FROM SEQ ID NO: 19 


SEQ ID NO: 14 


Predicted amino acid sequence 
ofORF15 


Translation of 2 EXONS 20332- 
20574, 20631-21860 
FROM SEQ ID NO: 19 


SEQ ID NO: 15 


Predicted amino acid sequence 
of ORF 1 6/Membrane transport 
protein 


Translation of 5 EXONS 24521- 

24054, 23996-23936, 23876-23184, 

231 1 1-22977, 22924-2281 8 (reverse 

complement) 

FROM SEQ ID NO: 19 


SEQ ID NO: 16 


Predicted amino acid sequence 
ofORF17/P450II protein 


Translation of 3 EXONS 28525- 
27673, 27606-27284, 2721 1-26837 
(reverse complement) 
FROM SEQ ID NO: 19 
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SEQ ID NO. 


DESCRIPTION 


COMMENTS 


SEQ ID NO: 17 


Predicted amino acid sequence 
of ORF18 (incomplete) 


Translation of 2 EXONS 29826- 
30995. 31054-31328 (incomplete) 
FROM SEQ ID NO: 19 


SEQ ID NO: 18 


DNA sequence of gene cluster- 
first 33,000 nucleotides 




CFO TT") NO- 1 9 


DNA <;pniif*Tirp of rlu^tpr- 

nucleotides 33,001-64,328 
(renumbered 1-31,328) 






ORFl/Esterase-like gene 


Start — 

<J v CXI I UUUJ 

Stop = 1813 
SEQ ID NO: 18 




uiNrv sequence oi uivr^ 


OlaTl — /DID 

Stop = 8602 
SEQ ID NO: 18 




gene 


Start = 1 HQS 1 

Stop = 9980 
SEQ ID NO: 18 




T""^NJ A cf*rt 1 1 **nr*f* rw f"M?FS 


Stop = 21990 
SEQIDNO:18 




l^in/a sequence oi 
LovC/Dehydrogenese 


Start = 1"K 1 SR 

Stop = 24410 
SEQ ID NO: 18 




u\\r\ sequence oi 
LovD/Esterase 


Stop = 26203 
SEQ ID NO: 18 


CPO TH MO- 96 

jLy 1 1NVJ. 


fiMA QAnnAnrp of OR FR/HMfi 

CoA Reductase 


Start = 30067 
Stop = 26559 
SEQ ID NO: 18 


Qpn TO NO- 77 


HMA cpni if*nr*f» ofT nvF/7n 

Finger I 


Start = 31360 
Stop = 32871 
SEQ ID NO: 18 


SFO ID NO* 7R 


ONI A Q**ni ipnpp of 

ORFIO/Metabolite transport 


Start = 1 400 
Stop = 3560 
SEQ ID NO: 19 


SEQ ID NO: 29 


DNA sequence of LovF/ScPKS 


Start = 4430 
Stop= 12464 
SEQ ID NO: 19 


SEQ ID NO: 30 


DNA sequence of ORF12 


Start = 13596 
Stop= 12709 
SEQ ID NO: 19 



-9- 



BNSDOCID: <WO 0037629A2 I > 




WO 00/37629 PCT/US99/29583 



SEQ ID NO. 


DESCRIPTION 


COMMENTS 


br:Q ID NU. 3 1 


n\! A conn ~ ^ ^ r /~\D 17 1 7/7 n 

uin/\ sequence ol UKr 1 5iLxi 
Finger II 


Ctnrt 1 /('An 

otan — l ooUo 
Stop = 14131 
SEQ ID NO: 19 


SEQ ID NO: 32 


DNA sequence of 
ORF14/Acetyl Co A transport 
gene 


Start = 19642 
Stop= 18341 
SEQ ID NO: 19 


SEQ ID NO: 33 


DNA sequence oi ORF15 


Start = 20332 
Stop = 21860 
SEQ ID NO: 19 


SEQ ID NO: 34 


DNA sequence of 

ORF 1 6/Membrane transport 

protein 


Mart — 24321 
Stop = 22818 
SEQ ID NO: 19 


OCA JJ~\ XJfV 7^ 

oty ID NO. J!> 


DNA sequence of 
ORF17/P450II gene 


Mart — ZojZd 
Stop = 26837 
SEQ ID NO: 19 


SEQ ID NO: 36 


DNA sequence of ORF 18 
(incomplete) 


Start = 29826 

to 31328 (incomplete) 

SEQ ID NO: 19 



Table 1 also notes the translation start and stop 
points in the various gene sequences . 

10 The sequence of the NPKS gene is not listed in SEQ 

ID NOs:21-36. This gene is characterized in U.S. patent 
5,744,350. However, SEQ ID N0s:18 and 19 do contain the 
sequence of the NPKS gene within the context of the 
entire gene cluster. 

15 To perform many embodiments of the present 

invention, one will need to recreate various genes or a 
portion of the gene cluster described herein. Applicants 
have provided sequence data in the Sequence Listing 
sufficient to allow one of skill in the art to construct 

20 numerous probes suitable to recreate the genes from an A . 
terreus genomic library. Applicants have also described 
below various methods for isolating A. terreus DNA. 
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Additionally, Applicants have deposited ATCC 
Accession No. ATCC 98876, which contains clone pWHM1263 
(cD4B) and ATCC Accession No. ATCC 98877 which contains 
clone pWHM1265 (CB2A4) . Both plasmids are described in 
more detail below. Fig. 4 describes clone 

CB2A4/pWHM1265 , and Fig. 6 describes clone CB4B/pWHM1263 . 
Fig. 1 also indicates the boundaries of the D4B and B2A4 
clones . 

The clones and their inserts may be prepared from 
the ATCC deposits by methods known to those of skill in 
the art . The DNA from the clones may be isolated and any 
gene within the gene cluster may be isolated and 
utilized . 

Increasing the Production of Lovastatin by Lovastatin- 
producing Fungi and Yeast 

In one embodiment, the present invention is a method 
of increasing the production of lovastatin in a 
lovastat in-producing fungi and yeast, preferably A. 
terreus ATCC20542 and ATCC20541. Other examples of 
suitable lovastatin-producing fungi and yeast are listed 
in Table 2, below. 
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Microorganisms other than A. terreus reported to produce lovastatin (mevinolin) 

Monascus (17 of 124 strains screened) species 1 

M. ruber 
M. purpureas 
M. pilosus 
M. vitreus 
M. pubigerus 

Penicillium sp.' 
Hypomyces sp. 
Doratomyces sp. 
Phoma sp. 
Eupenicillium sp. 
Gymnoascus sp. 
Trichoderma sp. 

Pichia labacensis 2 
Candida cariosilognicoia 

Aspergilus oryzea 3 
Doratomyces stemonitis 
Paeciiomyces virioti 
Penicillium citrinum 
Penicillin chrysogenum 
Scopulariopsis brevicaulis 
Trichoderma viride 



1. P. Juzlova, L. Martinkova, V. Kren. Secondary Metabolites of the fungus Monascus: a review. 
J. Ind. Microbiol. 16:163-170 and references cited therein (1996). 

2. N. Gunde-Cimerman, A. Plemenitas and A. Cimerman. A hydroxymethylglutaryl-CoA reductase 
inhibitor synthesized by yeasts. FEMS Microbiol. Lett. 132:3943 (1995). 

3. A.A. Shindia. Mevinolin production by some fungi. Folio Microbiol. 42:477-480 (1997). 



By "increasing the production" we mean that the 
amount of lovastatin produced is increased by at least 2- 
fold, preferably by at least 5-fold. The examples below 
35 demonstrate two preferred methods for analyzing strains 
for lovastatin production. In method A, the spore 
suspension is inoculated into a flask of SEED medium and 
grown. The resulting seed culture is used to inoculate 
FM media and grown for six days. In fermentation method 




WO 00/37629 



PCT/US99/29583 



10 



15 



20 



B, one inoculates 50 ml of RPM media and grows this 
larger culture for 7 days. 

Both cultures are extracted, pH adjusted, mixed with 
ethyl acetate and shaken for two hours. For analysis, 1 
ml of the ethyl acetate layer is dried under a nitrogen 
stream and resuspended in methanol. For TLC analysis, a 
small amount of the extract is run on C18 reverse phase 
TLC plates in a solvent system of methanol; 0.1% 
phosphoric acid. The TLC plates are developed by 
spraying with phosphomolybdic acid in methanol and 
heating with a heat gun. The extracts are compared with 
authentic lovastatin, monacolin J , monacolin L and 
dihydromonocolon L. 

If one wishes HPLC analysis, the examples below 
describe the use of a Waters Nova-Pak C18 column used 
with a solvent system of acetonitrile and phosphoric 
acid. A Waters 996 Photodiode Array Detector will detect 
the metabolites. Lovastatin was detected at 238 nm. 

In one embodiment, one would transform a lovastatin- 
producing fungi or yeast with the lovE/zinc finger I 
gene, preferably comprising the nucleotides of SEQ ID 
NO: 27. The examples below predict that this will result 
in an increase of at least 5-7 fold. Preferably, the 
increase will be at least 2.0-fold. 

One may also transform a lovastatin-producing fungi 
or yeast with the LovE gene isolated from other 
lovastatin-producing fungi or yeast. One may obtain this 
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gene by use of a probe derived from SEQ ID NO: 27 by 
methods known to those of skill in the art. 

One may also transform lovastat in-producing fungi 
and yeast with the D4B segment of the lovastatin 
5 production gene cluster (see Fig. 1), preferably as found 
in ATCC accession number 9S876 . Alternatively, one may 
transform lovastat in-producing fungi or yeast wich che 
entire gene cluster, as diagramed in Fig. 1: 

We envision that to successfully increase lovastatin 

10 production, one may also wish to transform less than the 
entire gene cluster. Preferably, one may determine what 
the smallest possible segment is by deleting various 
portions of the gene cluster and determining whether 
lovastatin production is continually increased. 

15 Similarly, if one begins with the D4B segment, one may 
delete various portions for the segment and determine 
whether lovastatin production is continually increased by 
at least 2-fold. 

Modification of the LovB/NPKS gene would produce 

20 other HMG CoA inhibitors. For example, Fig. 3 diagrams 
the relationship between mevastatin, lovastatin, 
simvastatin and pravastatin. In one example, the methyl 
transferase domain of the NPKS gene may be replaced with 
an inactive form to make pravastatin. The HMG - CoA 

25 reductase inhibitors within this invention include, but 
are not limited to, compactin (ML-236B) , lovastatin, 
simvastatin, pravastatin and mevastatin. 
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In another embodiment of the present invention, one 
may transform a lovastat in-producing organism with the 
genes described above and obtain the production of an HMG 
CoA reductase inhibitor with a structure different from 
5 monacolin J, monacolin L or lovastatin. Alterations in 
the side chain attached to C8 are the most likely 
possibility but other alterations may occur. These 
alterations would happen through the native biochemistry 
of the organism. 

10 If one wishes to express the A. terreus genes in 

yeast, one may wish to consult examples in which others 
have engineered fungal secondary metabolism genes for 
expression in yeast. (See for example, J. T. t Kealey, 
al . , Proc . Natl . Acad. Sci . USA 95:505-509 (1998) ) . The 

15 exact approach could be used with the NPKS (LovB) and 

ScpKS (LovF) genes, and a somewhat simpler approach with 
the other lovastatin genes in their cDNA forms. 

Production of HMG -CoA Reductase Inhibitors by Fungi and . 
Yeast That Do Not Natively Produce Inhibitors . 

20 In another embodiment, the present invention is the 

production of HMG-CoA reductase inhibitors, such as 
lovastatin, by fungi and yeast that do not natively 
produce lovastatin. An example of a suitable fungi or 
yeast is A . nidulans and S. cerevisiae, respectively. 

25 For this embodiment one preferably transforms the 

genes within the D4B segment into the non- inhibitor- 
producing strain. By this method, one would produce 
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monacolin J (See Fig. 2) which could be chemically 
converted to lovastatin by one of skill in the art. 

Monacolin J, in its lactone form obtained by 
treatment with anhydrous acid under dehydrative 
conditions, is preferably treated with a derivative of 
(2S) -2 -methybutyric acid, in which the carboxyl group has 
been suitable activated for undergoing esterif ication, 
and the resulting lovastatin is isolated by conventional 
methods. For example, see WO 33538, U.S. patent 
4,444,784 and Med. Chem. 29:849 (1986) . These are 
citations for synthesis of simvastatin from monacolin J. 
One would use the same method, but use the (2S)-2- 
methylbutyrate derivative to make lovastatin. 

In another embodiment of the present invention, one 
would transform the genes within the D4B segment, 
including an entire LovF/SCPKS gene, into the non- 
inhibitor-producing organism. By this method, one would 
produce lovastatin in a non- lovastat in-producing 
organism. 

In another embodiment of the present invention, one 
may transform a non- lovastat in-producing organism with 
the genes described above and obtain the production of an 
HMG CoA reductase inhibitor with a structure different 
from monacolin J, monacolin L or lovastatin, as described 
above . 

Modification of the LovB/NPKS gene would produce 
other inhibitors. For example, Fig. 3 diagrams the 
relationship between mevastatin, lovastatin, simvastatin 
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and pravastatin. In one example, the methyl transferase 
domain of the NPKS gene may be replaced with an inactive 
form to make pravastatin. The HMG-CoA reductase 
inhibitors within this invention include, but are not 
limited to, compactin (ML-236B) , lovastatin, simvastatin, 
pravastatin and mevastatin. 

Production of Intermediate Materials 

In another embodiment, the present invention is a 
method of isolating intermediate materials in the 
production of lovastatin and analogs such as mevastatin 
and simvastatin. For example, the Examples below 
demonstrate the disruption of the lovastatin projection 
gene cluster with mutagenized LovC, LovD, LovF, LovA or 
LovB genes. Disruption of many of these genetic elements 
of the lovastatin production gene cluster will result in 
accumulation of intermediate materials. Therefore, to 
practice this embodiment of the present invention, one 
would transform a suitable lovastat in-producing host with 
a mutagenized gene within the D4B segment, as described 
below . 

Many other mutations would be suitable to destroy 
the function of LovC, LovD, LovF, LovA or LovB. All that 
is necessary is these genes be disrupted to the extent 
that they are non- functional . 

Production of Lovastatin Analogs 

In another embodiment, the present invention 
provides a method for engineering the production of 
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lovastatm analogs in such organisms as fungi or yeast, 

using monacolin J as the starting point. 

Isolated DNA Segments 

In another embodiment, the present invention is a 
5 DNA segment capable of conferring lovastatin or monacolin 
J production or increase in lovastatin or monacolin J 
production in yeast or fungi. In a preferred example, 
this segment is the "D4B segment" that is deposited at 
ATCC 98876. The nucleotide sequence of this segment is 
10 found in residues 579 - 33,000 of SEQ ID NO: 18 and 
residues 1 - 5,349 of SEQ ID NO: 19. 

In another embodiment, the present invention is the 
entire A. terreus lovastatin gene cluster, as - exemplified 
by SEQ ID NOs:18 and 19 and ATCC deposits 98876 and 
15 98877. 

The present invention is also the individual genes 
that make up the A. terreus lovastatin gene cluster. 
Therefore, the present invention is a nucleic acid 
segment selected from the group of consisting of SEQ ID 

20 NOs:20 - 36. Preferably, the present invention is the 
coding region found within SEQ ID NOs:20 - 36 and 
described in Table 1. The present invention is also a 
mutagenized version of SEQ ID NOs:22, 24, 25 and 29, 
wherein the gene is mutagenized to be non- functional in 

25 terms of lovastatin or monacolin J production. 
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Organisms with Inrrp ased Lovastatm or Monacolin J 
Production 

In another embodiment, the present invention are the 
organisms described above. These organisms include 
5 lovastatin-producing organisms, preferably yeast and 

fungi, that have been engineered to display at least a 2- 
fold increase in lovastatin or monacolin J production. 
The organisms also include non- lovastatin-producing 
organisms, preferably yeast or fungi, that have been 
10 engineered to produce monacolin J or lovastatin. 

Ant i fungal Compounds 

Applicants note that lovastatin, monocolin J, 
monocolin L and dihydromonocol in L all have varying 
degrees of antifungal activity. Applicants envision that 

15 the present invention is also useful for providing 

antifungal compounds and organisms engineered to express 
antifungal compounds. Preferably, one would measure the 
antifungal properties of .a compound in the manner of N. 
Lomovskaya, et al . , Microbiology 143:875-883, 1997. 

2 0 Measurement of inhibition of yeast growth can be found in 
R. Ikeura, et al., sL Antibiotics 41:1148, 1988. The 
same general methods could be used for all fungi. Both 
of these references are hereby incorporated by reference. 
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EXAMPLES 

1 . General Methods and Procedures 

Construction of an A . terreus ATCC20542 genomic library. 

A, terreus ATCC20542 genomic DNA was partially 
5 digested with Sau3AI so as to produce an average fragment 
size of 40 - 50 kb . The partially digested genomic DNA 
was then separated on a sucrose gradient and the 40 - 50 
kb fraction was collected. Cosmid AN26 (Taylor and 
Borgmann, Fungal Genet . Newsletter 43, 1996) was prepared 

10 by digestion with Clal, dephosphorylated with CIP, then 
digested with BamHI to create the two cosmid arms. 
Ligation reactions with genomic DNA fragments and cosmid 
arms were optimized and packaged using Gigapack III XL 
packaging extract (Stratagene) . The packaged cosmid 

15 library was infected into E. coii JM109 and plated out 
onto LB agar (Sambrook, e_t a_l. , Molecular Cloning. A 
Laboratory Manual . 2nd ed. Cold Spring Harbour 
Laboratory Press, 1989; other standard methods used can 
be found here also) with ampicillin (50 /ig/ml) plates. 

20 After checking for the presence of insert DNA in a 

selection of clones, 5000 colonies were picked into LB 
plus 50 fig/ml ampicillin filled microtitre plates and 
grown overnight at 37 C C. The colonies were replica 
plated onto nylon membranes (Amersham Hybond-N) . 

25 Glycerol was added at a final concentration of 15% (v/v) 
to the microtitre plates and these were stored at -70°C. 
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Isolation of genomic clones containing the lovastatin 
biosynthesis cluster . 

A 2.8 kb EcoRI fragment from pTPKSlOO containing 
part of the NPKS gene (Vinci, et al., U.S. Patent No. 
5,744,350) was gel-isolated and labelled with digoxigenin 
using the Genius Kit II (Boehringer Mannheim) . This 
labelled fragment was hybridized (65°C, 5x SSC) with the 
nylon membranes containing the A. terreus genomic 
library, then washed (65°C, 0 . Ix SSC) . Two positive 
clones were identified, pWHM1263 (cD4B) and pWHM1264 
(cJ3A) . Two of these clones, pWHM1263 (cD4B) and 
pWHM1265 (cB2A4) , have been deposited in the ATCC 
(American Type Culture Collection, 10801 University 
Boulevard, Menassas, VA 20110) at accession number ATCC 
98876 and 98877, respectively, under the terms and 
conditions of the Budapest Treaty. The presence of the 
NPKS gene was confirmed initially by restriction 
digestion and later by DNA sequencing. 

Overlapping clones were found by repeating the 
hybridization process using labelled fragments from both 
ends of the insert in pWHM1263 . This resulted in the 
isolation of pWHM1265- 1270 (cB2A4, cL3E2 , cJ3B5, C02B5, 
CR3B2, cW3Bl) from downstream of the NPKS gene and 
pWHM1271 (cQIFl) from upstream of NPKS. All these clones 
were transformed into E. coli strain STBL2 (Stratagene) 
to help prevent rearrangements. 

Fig. 4 is a diagram of the cB2A4/pWHM1265 clone. 
This clone contains an insert of approximately 43 kb in 
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AN26 and includes the nucleotide sequence from at least 
nucleotides 4988 of SEQ ID NO: 19 to nucleotide 31,328 of 
SEQ ID NO: 19 and lO - 14 kb of uncharacterized DNA . Fig 
6 is a schematic diagram of cD4B/pWHM12 63 . This clone 
5 contains a 37,770 bp insert in AN26 and contains 

nucleotides 579 - 33,000 of SEQ ID NO: 18 and nucleotides 
1 - 5,349 Of SEQ ID NO:19. 
Sequencing strategy and analysis. 



10 pWHM1415) were constructed in pSPORTl (Gibco-BRL) and 
pGEM3 (Promega) . Plasmid DNAs for sequencing were 
prepared using the QiaPrep spin miniprep kit (Qiagen) . 
Cycle sequencing was carried out using the AmpliTaq FS or 
BigDye reagents ( ABI ) and were analyzed using a ABI model 

15 373 or 377 DNA Sequencer. Primer walking was performed 

by synthesis of 18-22 bp oligonucleotide primers based on 
the sequenced DNA strand, with the help of the Oligo 4.05 
program (National Biosciences, Inc.). Every region of 
DNA was sequenced at least once on both strands. Direct 

20 sequencing of cosmids and PCR products was used to 

confirm adjoining regions where no overlapping clones 
existed. DNA sequence analysis and manipulations were 
performed using SeqMan ( DNAS TAR) and SeqEd (ABI) 
software. Assignments of putative ORFS , including 

25 putative introns, were performed with the aid of BLAST 

2.0 searches (Atschul, et al . , Nucl . Acids Res . 25:3389- 
3402, 1997), and the Genetics Computer Group (GCG) 
programs (Program Manual for the Wisconsin Package, 



A series of overlapping subclones (pWHM1272- 
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Version 8, September 1994, Genetics Computer Group, 
Madison, WI ) , version 8.1. 

Isolation and characterization of lovF (ScPKS, ORF11) , 
lovD (EST1, ORF7), lovC (DH, ORF6), and lovA (P450I, 
5 ORF3) mutants. 

lovF 

To disrupt the polyketide synthase gene, lovF, a 1.7 
kb EcoRI fragment internal to the lovF gene was subcloned 
from pWHM1265 into pSPORTl to give pWHM12 91 . The ScPKS 

10 fragment was then subcloned from this vector, as an 

Acc65I - Hindi I I fragment, into pPLOA (Vinci, e£ al . , 
U.S. Patent No. 5,744,350) to give pWHM1416. This vector 
contains the phleomycin (Zeocin, obtained from 
InVitrogen) resistance gene for selection in ^. terreus . 

15 A. terreus ATCC20542 was then transformed to Zeocin 
resistance with this plasmid as described below. 
Transf ormants were screened for lovastatin production as 
described below (Method A) . In one of the transf ormants , 
WMH1731, lovastatin production was abolished and a new 

2 0 compound accumulated. This new compound comigrated with 
monacolin J on TLC and HPLC according to the methods 
described below. Semi-preparative HPLC was used to 
partially purify the major product which was then 
analyzed by HPLC - MS. The same mass and fragmentation 

25 pattern as authentic monacolin J was observed. To 

confirm the disruption of the lovF gene, total genomic 
DNA was prepared from wild- type A. terreus ATCC20542 and 
the WMH1731 mutant strain. The genomic DNA was digested 
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with Ba/nHI and Hindlll, electrophoresed on an agarose gel 

and capillary blotted onto a nylon membrane. The 
membrane was hybridized with the 1.7 kb EcoRI fragment 
from pWHM1416 labelled using the Genius II kit 
5 (Boehringer Mannheim) using the conditions described 

previously. The wild-type strain had hybridizing bands 
at 4.2 kb for BamHI and 11.5 kb for Hindlll. As 
predicted, the WMH1731 mutant strain had hybridizing 
bands at 6.5 kb and 2.2 kb for BairiHI and 11 kb and 7.8 kb 
10 for Hindi I I confirming the homologous integration of a 
single copy of pWHM14I6 at the lovF locus. 
lovD 

To disrupt the putative esterase/carboxypept idase- 
like gene, lovD, a 4.8 kb NotI - EcoRI fragment from 

15 pWHM1263 was subcloned into pSPORTl to give pWHM12 74 . 

This plasmid was digested with Hindi I I and SsiWI and a 
1.8 kb fragment was isolated. The plasmid was also 
digested with Hindi I I and BarriHI and the 6.6 kb fragment 
was isolated. pPLOA was digested with Ba/nHI and Acc65I 

20 and the 2.1 kb fragment containing the phleomycin 

resistance marker was purified. These three fragments 
were ligated together and used to transform competent E. 
coli cells. The expected plasmid, pWHM1417, containing 
the phleomycin resistance gene flanked by the beginning 

25 and the end of the lovD gene was isolated. This plasmid 
was linearized by digestion with Xbal or RsrII before 
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being used to transform A. terreus ATCC20542 to Zeocin 

resistance. Transf ormant s were screened for lovastatin 

production as described below (Method A) . In one of the 

transf ormants , WMH1732, lovastatin production was 

5 abolished and a new compound accumulated. This new 

compound comigrated with monacolin J on TLC and HPLC 

according to the methods described below. Semi- 

preparative HPLC was used to partially purify the major 

product which was then analyzed by HPLC - MS. The same 

10 mass and fragmentation pattern as authentic monacolin J 
was observed. To confirm the disruption of the lovD 
gene, total genomic DNA was prepared from wild type A. 
terreus ATCC20542 and the WMH1732 mutant strain. The 
genomic DNA was digested with Apal , run out on an agarose 

15 gel and capillary blotted onto a nylon membrane. The 
membrane was hybridized with the 4.8 kb No tl - EcoRI 
fragment from pWHM1274 labelled using the Genius II kit 
using the conditions described previously. The wild-type 
strain had hybridizing bands at 9 kb, 8 . 4 kb and 1.5 kb . 

2 0 As predicted the mutant strain had hybridizing bands at 9 
kb, 8 kb, 3 kb and 1.5 kb confirming the homologous 
integration of a single copy of pWHM1417 at the lovD 
locus . 
lovA 

25 To disrupt the cytochrome P450 I gene, lovA, an 11 

kb Acc65I - EcoRI fragment from pWHM1263 was subcloned 
into pGEM3 to give pWHM1272 . From this plasmid a 2.1 kb 
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Apal - SnaBI fragment was purified and ligated to Apal - 

EcoRV digested pPLOA to give p450Phleo (pWHM1418) . From 

this plasmid a 4.2 kb Apal - Notl fragment was purified 

and ligated with a 1.8 kb EagI - Kpnl fragment from 

5 pWHM12 72 and Apal - Kpnl digested pGEM7 to give 

p450Dphleo (pWHM1419) which contains the lovA gene 

disrupted by the phleomycin resistance gene. This 

plasmid was then digested with Kpnl and Apal and the 

resulting fragment was used to transform A. terreus 

10 ATCC20542 to Zeocin resistance. Transf ormant s were 

screened for lovastatin production as described below 
(Method A). In one of the transf ormant s , WMH1733, 
lovastatin production was abolished and two new compounds 
accumulated. Genomic DNA was prepared from this strain 

15 and from A. terreus ATCC20542, digested with EagrI , run 

out on an agarose gel, and capillary blotted onto a nylon 
membrane. The membrane was hybridized with the 6 kb Apal 
- Kpnl fragment from pWHM1419 labelled using the Genius 
II kit using the conditions described previously. The 

20 wild-type strain had hybridizing bands at 2.0 kb, 1.9 kb 
and 1.1 kb. Mutant strain WMH1733 had hybridizing bands 
at 2.5 kb, 2.0 kb, l.l kb and 0.7 kb confirming the 
homologous integration of a single copy of the fragment 
from pWHM1419 at the lovA locus. 
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lovC 



To disrupt the dehydrogenase- like gene, lovC, a 2 kb 
EcoRI - Bglll fragment from pTPKSlOO was ligated with a 
1.7 kb EcoRI - SacI fragment from pWHM1274 and Bglll - 
5 SacI digested litmus 28 (New England Biolabs) to produce 
pDHl (pWHM1420) . Another plasmid pDH2 (pWHM1421) was 
constructed from a 2.2 kb AccSSl - Sacl fragment from 
pWHM1274, a 2 . 1 kb Hindi I I - SacI fragment from pPLOA 
containing the phleomycin resistance gene and Hindi I I - 

10 Acc65I digested litmus 28. The disruption vector pDH-dis 
(pWHM1422) was constructed by ligating together a 2.5 kb 
Bglll - Hpal fragment from pWHM1420, a 4 . 3 kb EcoRV - 
Kpnl fragment from pWHM1421 and Bglll - Kpnl digested 
litmus 28. This plasmid was digested with Bglll and Kpnl 

15 and the resulting 6.8 kb fragment was used to transform 

A . terreus ATCC2 0542 to Zeocin resistance. Transf ormant s 
were screened for lovastatin production as described 
below (Method A) . In two of the transf ormant s , WMH1734 
and WMH1735, lovastatin production was abolished. 

20 Genomic DNA was prepared from these strains and from A. 
terreus ATCC20542, digested with EagI , run out on an 
agarose gel, and capillary blotted onto a nylon membrane. 
The membrane was hybridized with the 6.8 kb Bgl II- Kpnl 
fragment from pWHM1422 labelled using the Genius II kit 

25 using the conditions described previously. The wild type 
strain had hybridizing bands at 5 kb, 1.5 kb and 1.3 kb. 
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Mutant: strain WMH1734 had hybridizing bands at 4 . 9 kb, 
1.3 kb, 1.0 kb and 0.7 kb confirming the homologous 
integration of a single copy of the fragment from 
pWHM1422 at the lovC locus. The other mutant strain, 
5 WMH1735, had a similar banding pattern but with 

additional hybridizing bands indicating that multiple 
integration events had occurred, one of which was at the 
lovC locus. 

Construction and characterization of the A . terreus 
10 strain with extra copies of lovE . 

A 10.4 kb NotI- EcoRI fragment containing the 

putative regulatory gene, lovE was subcloned from 

pWHM1263 to pSPORTl to give pWHM12 76 . From this plasmid 

a 3.9 kb Hindi 1 1 - Ba/nHI fragment was subcloned into 

15 pGEM7 to give pWHM142 3 . The regulatory gene was 

subcloned from this vector into pPLOA as an SstI - XbaRI 
fragment to give pWHM1424 (Fig. 5) . pWHM1424 contains 
nucleotides 30,055 - 33,000 from SEQ ID NO: 18 and 
nucleotides 1 - 1,026 from SEQ ID NO: 19. 

20 Extra copies of the regulatory gene were introduced 

into A . terreus ATCC20542 by transformation to Zeocin 
resistance with pWHM1424 . Transf ormant s were fermented 
(method A) and screened for lovastatin production 
initially by TLC analysis. Most of the transf ormants 

25 appeared to be producing significantly more lovastatin 

than the wild- type strain. The yields of lovastatin from 
the two transf ormant strains, WMH1736 and WMH1737, which 
had the most elevated levels compared to the wild-type 
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was quantified by HPLC as described below. These were 

found to produce 7- fold and 5- fold more lovastatin chan 

the A . terreus ATCC20542 strain. 

Because of the way that the DNA integrates 

5 (ectopically) , each transf ormant is or can be unique, 

genotypically and phenotypically . However, some will be 

overproducers ; others may exhibit no difference, for 

unknown reasons . 

Heterologous expression of the lovastatin biosynthesis 
10 genes . 

To place the NPKS gene (lovB) under the control of 
the inducible alcA promoter, the 11.5 kb Kpnl - Avrll 
fragment from pTPKSlOO containing the NPKS open reading 
frame was ligated into pAL3 (Waring, et al. , Gene 79:119, 

15 1989) previously digested with fCpnl and XJbal . The 

resulting plasmid was designated pAL3TPKS (WHM1425) . The 
polymerase chain reaction was used to amplify the NPKS 
gene sequence between the NPKS promoter region just 
upstream of the t ranslat ional start codon and a Agel site 

20 internal to NPKS. The design of the forward primer 

introduced a Kpnl site 31 bases from the translat ional 
start codon allowing the NPKS to be placed against the 
alcA promoter but also incorporating upstream elements 
from the A . terreus system. Amplification was performed 

25 using Vent DNA polymerase with pTPKSlOO as template and 1 
/xmol of each primer in a final volume of 100 fil using the 
manufacturer's buffer recommendations. After an initial 
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denaturation cycle of 10 minutes at 95°C amplification 

was achieved with 30 cycles of 95°C for 1 minute; 55 C C for 

1 minute and 72 °C for 1.5 minutes. The final cycle was 

followed by 10 minutes at 72 °C to ensure complete 

5 polymerization. The amplified product (1.7 kb) was 

digested with Kpnl and Agel and ligated into pWHM1425 

that had been digested with the same enzymes and gel 

isolated. The resulting plasmid was designated 

pAL3TPKSNT (pWHM1426) . The region introduced by PCR was 

10 sequenced on a ABI automated DNA sequencer to ensure 
sequence fidelity. This plasmid was then used to 
transform A, nidulans strain Kill (Fungal Genetics Stock 
Centre) to uridine prototrophy. 

Transf ormant s were grown by inoculating 0.5 ml of 

15 spore suspension (10 s c.f.u./ml) in 50 ml YEPD in a 250 ml 
unbaffled flask. This was then grown for 20 hours at 250 
rpm and 37°C (New Brunswick Scientific Series 25 
Incubator Shaker) . The mycelia were then harvested by 
filtration through Miracloth (Calbiochem) , rinsed with 

20 sterile, distilled water, and inoculated into fresh 250 
ml unbaffled flasks containing 50 ml AMM + lactose + 10 
mM cyclopentanone and grown for a further 2 0 hours under 
the same conditions. The mycelia were harvested by 
filtration using Miracloth (Calbiochem) , squeezed as dry 

25 as possible and frozen in liquid nitrogen. Protein 

extracts for SDS-PAGE and western analysis were prepared 
as described in Kennedy and Turner, Molec , Gen . Genet . 
(1996), 253:189-197, 1996. 
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One transf ormant , WMH173 8, was shown to have a large 

protein (>2.00 kDa) visible on a SDS-PAGE gel that cross 

reacted with the affinity purified NPKS antibodies 

(Panlabs). This strain WMH1738 was transformed to 

5 hygromycin B resistance with pWHM1263 . Transf ormant 

colonies were screened for lovastatin resistance and for 

the production of new metabolites as described below and 

two strains WMH1739 and WMH1740 were chosen for further 

analysis. Both of these ' strains were found to be 

10 significantly resistant (up to 100 ^g/ml on solid media) 
to lovastatin compared with the host strain. This was 
analyzed by streaking 10 fil of a spore suspension on 
solid AMM plates containing lovastatin at 0, 0.1, 0.5, 1, 
5, 10, 50 and 100 /ig/ml and incubating at 37°C. Strains 

15 WMH1739 and WMH1740 were compared to strains WMH1741 and 
WMH1742 which were derivatives of WMH1738 transformed to 
hygromycin resistance with AN26 . Strains WMH1739 and - 
174 0 exhibited no inhibition of growth at any of these 
lovastatin concentrations whereas strains WMH1741 and - 

20 1742 showed slight inhibition of grown at 5 /ig/ml and 

almost complete growth inhibition at 50 /ig/ml. The two 
lovastatin resistant strains were fermented in 
lovastat in-producing conditions using fermentation method 
B and extracts were analyzed for lovastatin related 

25 metabolites as described below. Both strains were found 
to produce new metabolites. One compound that was common 
to both comigrated with monacolin J on TLC and HPLC 
analysis by the methods described below. Semi- 
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preparative HPLC was used co partially purify some of 

this compound, which was then analyzed by HPLC -MS. It 
had the same mass and fragmentation pattern as authentic 
monacolin J. The other compound, found in only one of 
5 the strains, comigrated with monacolin L on TLC and HPLC. 
METHODS 

Solid medium for growth of A. terreus 

For the generation of spore suspensions A. terreus 

strains were grown on CM agar at 3 0°C for 4 to 5 days. 

10 CM Agar (for CM liquid medium the agar was omitted) : 
50 ml Clutterbuck' s salts (Vinci, et a],., U.S. 

Patent No. 5,744,350) 

2 ml Vogel's trace elements (Vinci, et aJL-, U.S. 
15 Patent No. 5,744,350) 

0.5% Difco Bacto tryptone 

0.5% Difco Bacto yeast extract 

1% glucose 

2% Difco Bacto agar 
20 in 1 liter of distilled water 

Clutterbuck' s salts : 

12% NaNG 3 
1.02% KC1 
1.04% MgS0 4 . 7H 2 0 
2 5 3.04% KH 2 P0 4 

Vogel's trace elements: 

0.004% ZnCl 2 
0.02% FeCl 3 
0.001% CuCl 2 
30 0.001% MnCl : -4H 2 0 

0.001% Na 2 B 4 0 7 • 1 0H 2 O 

0 . 001% (NH. ) e Mo 7 0 2 - • 7H 2 0 

For long term storage A . terreus spores were 
suspended in SSS ( 10% glycerol , 5% lactose) and stored at 
35 -70°C. 
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For the generation of spore stocks A. nidulans was 

grown on the following solid growth medium (ACM) for 3 ~o 
4 days at 37°C. 
ACM : 

5 2% Difco Bacto malt extract 

0.1% Difco Bacto peptone 
2% glucose 

2% agar (Difco, Detroit, MI) 

For strains which required para-aminobenzoic acid 
10 (PABA) for growth, PABA was added to a final 

concentration of 1 pg/ml . For strains which required 
uracil and uridine these were added at 20 mM and 10 mM, 
respectively. Spores were suspended in Tween 80 - saline 
solution (0.025% Tween 80, 0.8% NaCl) and stored at 4~C. 
15 AMM : 

0.6% (w/v) NaN0 3 
0.052% (w/v) KC1 
0.152% (w/v) KH.PO, 
0.052% (w/v) MgSO, - 7H.O 
20 1% (w/v) glucose 

0.1% (v/v) AMM trace elements solution 

pH to 6.5 and make up to 1 liter with distilled 

water . 

For preparation of plates 2% (w/v) Difco Bacto agar 
25 was added. If required the glucose can be omitted and an 
alternative carbon source (e.g., lactose added at the 
same concentration) . For the preparation of 
transformation plates KCl was added at 4.47% (w/v) (0.6 
M) . 

30 AMM trace elements solution: 

0.1% (w/v) FeS0 4 -7H 2 0 
0.88% (w/v) ZnS0 4 -7H 2 0 
0.04% (w/v) CuSO, • 5H 2 0 
0.015% (w/v) MnS0 4 -4H 2 0 
35 0.01% (w/v) Na 2 B 4 0 7 • 10H 2 O 
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0.005% ( NH« ) € Mo 7 0 24 * 7H 2 0 
distilled wacer to 1 liter 

Large scale genomic DNA preparation from A. terreus for 
genomic library construction. 

5 A 2.5 ml aliquot of spore suspension (10 s c.f.u./ml) 

was used to inoculate 500 ml of liquid CM medium and 

grown for 2 0 hours at 3 0°C and 20 0 rpm. The mycelium was 

harvested by filtration through Miracloth (Calbiochem) 

and rinsed extensively with water then TSE [150 mM NaCl, 

10 100 mM Na 2 EDTA, 50 mM Tris-HCl pH " 8 . 0 ] . The mycelium was 
squeezed dry, broken into small pellets and frozen in 
liquid nitrogen then ground to a fine powder in a pre- 
chilled pestle and mortar followed by transferral to a 
500 ml flask. Fifty ml of extraction buffer [150 mM 

15 NaCl, 100 mM Na 2 EDTA , 50 mM Tris-HCl pH 8.0, 2% (w/v) SDS] 
and 10 ml of toluene was added to the flask which was 
shaken at 60 rpm for 72 hours. This mixture was 
centrifuged at 1000 x g for 15 minutes and the 
supernatant was removed and extracted with an equal 

20 volume of chloroform : isoamyl alcohol (24:1 vol /vol ) . 

This mixture was centrifuged at 10,000 x g for 30 minutes 
at 15°C. The aqueous layer was carefully removed and 1.1 
volumes of ethanol was layered on top. The DNA was 
spooled out from the resulting suspension and resuspended 

25 in 5 ml TE [10 mM Tris-HCl pH 8.0, 1 mM EDTA] + 50 fig /ml 
RNase and 100 /ig/ml proteinase K then incubated at 37°C 
for 2 hours. The mixture was extracted again with 
chloroform: isoamyl alcohol (24:1) and the DNA was spooled 
out as before. Following resuspension in 1 ml of TE the 
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DNA was extracted once with phenol : chloroform : isoamyl 
alcohol (25:24:1, vol/vol), once with chloroform : isoamyl 
alcohol (24:1) and precipitated with 0.6 volumes 
isopropanol . The DNA clot was removed, dried briefly and 
5 resuspended in 0.5 ml TE . 

Small scale genomic DNA preparation from A . terreus for 
Southern blot. 

A 0.5 ml aliquot of spore suspension (10 a c.f .u./ml) 

was used to inoculate 100 ml of liquid CM and grown for 

10 20 hours at 30°C and 200 rpm. The mycelium was harvested 
by filtration through Miracloth (Calbiochem) and rinsed 
extensively with water then TSE [150 mM NaCl , 100 mM 
Na 2 EDTA, 50 mM Tris-HCl pH 8.0] . The mycelium was 
squeezed dry, broken into small pellets and frozen in 

15 liquid nitrogen. The mycelium was ground to a fine 

powder in a pre-chilled pestle and mortar and transferred 
to a mortar pre-heated to 65°C. Three ml of lysis buffer 
[0.5 M NaCl, 10 mM Tris-HCl pH'7.5, 10 mM EDTA, 1% (w/v) 
SDS] at 65°C was added and 0.3 ml of 10% (w/v) 

20 cetyl trimethylammonium bromide in 0.7 M NaCl . After 
thorough mixing to form a slurry, 3 ml of 
phenol : chloroform : isoamyl alcohol (25:24:1) was added. 
This mixture was transferred to a Corex tube and 
incubated at 65°C for 15 minutes. Following 

25 centrif ugation at 12,000 x g for 15 minutes at 4°C the 

aqueous phase was carefully removed and re-extracted once 
with phenol, once with phenol : chloroform : isoamyl alcohol 
(25:24:1) and once with chloroform : isoamyl alcohol 
(24:1) . The DNA was precipitated from the extract by 



-35- 



3NSDOCID: <WO 0037629A2 I > 



WO 00/37629 



PCT/US99/29583 



addition of 0.1 volume of 3 M sodium acetate pH 5 and 0.6 
volumes isopropanol then collected by cent ri fugat ion 
(10,000 x g, 10 minutes, 4°C). After washing with 70% 
ethanol the pellet was briefly dried and resuspended in 
5 TE + RNase (50 /ig/ml ) . 

Transformation of -A. terreus . 

A 0.5 ml aliquot of spore suspension (10 s c.f.u./ml) 
was used to inoculate 100 ml of liquid CM and grown for 
20 hours at 30°C and 200 rpm. The mycelium was harvested 

10 by centrif ugation at 2000 x g for 15 minutes at 4 C C and 
washed twice with an aqueous solution containing 0.27 M 
CaCl 2 and 0.6 M NaCl . To produce protoplasts the washed 
mycelia was resuspended in 20 ml of the same solution 
containing 5 mg/ml Novozym 234 (NovoNordisk) and 

15 incubated at 30°C for 1-3 hours with gentle agitation. 
Protoplasts were separated from undigested mycelia. by 
filtration through Miracloth (Calbiochem) . The 
protoplast suspension was diluted with an equal volume of 
STC1700 [1.2 M sorbitol, 10 mM Tris-HCl pH 7.5, 35 mM 

20 NaCl] and incubated on ice for 10 minutes. The 

protoplasts were collected by centrif ugation (2000 x g, 
10 minutes, 4°C), washed with STC1700 and resuspended in 
1 ml STC1700. Plasmid DNA, purified using Qiagen 
columns, (2 - 5 in 10 /il) was added to 150 /il of 

25 protoplast suspension- and incubated at room temperature 
for 25 minutes. PEG solution [60% (w/v) polyethylene 
glycol 4000, 50 mM CaCl 2/ 10 mM Tris-HCl pH 7.5] was added 
to the DNA/protoplasts mixture in three steps: 250 /il, 
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250 fil t and 850 fil with mixing afcer each addition. The 
suspension was incubated at room temperature for 25 
minutes then diluted to 10 ml with STC1700. Protoplasts 
were collected by centrif ugat ion as above and diluted 
5 with 500 fil STC1700. 100 fil aliquots of this mixture 

were plated onto osmotically stabilized plates [CM medium 
containing 3% (w/v) Dif co Bacto agar and 23.4% (w/v) 
mannitol, 15 ml of agar per plate] . After 4 hours growth 
at 30°C / 25 ml of OL agar [1% (w/v) Difco Bacto peptone, 

10 1% (w/v) Difco Bacto agar, 200 fig /ml Zeocin] was 

overlayered onto each dish. The plates were incubated 
for 3-4 days at 30°C before transformant colonies were 
picked. These were streaked to single colonies twice on 
selective media (CM + 100 fig /ml Zeocin) before spore 

15 suspensions were prepared. 

Transformation of A . nldulans * 

A 0.5 ml aliquot of spore suspension (10 8 c.f.u./ml) 
was used to inoculate 100 ml of YEPD [2% (w/v) Difco 
Bacto yeast extract, 2% (w/v) glucose, 0.1% Difco Bacto 

20 peptone] liquid medium including necessary supplements 

and grown for 20 hours at 37°C and 200 rpm. The mycelia 
was harvested by cent ri f ugat ion (2000 x g # 10 minutes, 
4°C) and washed twice with 0.6 M KC1 . To generate 
protoplasts the mycelia was resuspended in 20 ml of 0.6 M 

25 KC1 containing 5 mg/ml Novozym 234 and incubated at 30°C 
for 1-2 hours with gentle shaking. Protoplasts were 
separated from undigested mycelia by filtration through 
Miracloth (Calbiochem) . The protoplasts were harvested 
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by centrif ugat ion as described above and washed twice 

with 0.6 M KCl, then resuspended in 10 ml 0.6 M KC1 + 50 

mM CaCl 2 . After counting in a haemocytometer the 

protoplasts were harvested by centrif ugat ion as before 

5 and resuspended to a final concentration of 5 x 10 8 

protoplasts/ml. To 50 fil of protoplast suspension, 5 fil 

of DNA (2-5 fig , purified using Qiagen columns) was 

added, then 12.5 fil of PEG solution [25% (w/v) PEG 6000, 

50 mM CaCl 2/ 10 mM Tris - HC1 pH 7.5] and the mixture was 

10 incubated on ice for 20 minutes. A further 0.5 ml of PEG 
solution was added and the mixture was incubated on ice 
for a further 5 minutes. A 1 ml aliquot of 0.6 M KC1 + 
50 mM CaCl 2 was added and the protoplasts were plated out 
in 50 fil, 200 fil t and 400 fil aliquots. For 

15 transformation to uridine prototrophy, protoplasts were 
plated out onto AMM + 0.6 M KCl plates without adding 
uridine or uracil supplements. Plates were incubated at 
37°C for 3-4 days when transf ormant s were picked. For 
transformation to hygromycin B resistance protoplasts 

20 were plated out onto AMM + 0.6 M KCl plates (15 ml) and 
incubated for 4 hours at 30°C. 30 ml of 1% peptone, 1% 
agar, 1 mg/ml hygromycin B was then used to overlay the 
plates, which were incubated for 3-4 days when 
transf ormants were picked. Transf ormant s from both 

25 methods were streaked out to single colonies on selective 
media (i.e., lacking uridine/uracil supplements or 
containing 1 fig /ml hygromycin B) twice before spore 
suspensions were. made. 
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Analysis of strains for lovastatin production. 

Two fermentation methods were used for the analysis 
of lovastatin production. In Method A, 0 . 5 ml of spore 
suspension (10 8 c . f . u . /ml ) was inoculated into 25 ml of 
5 SEED medium in 250 ml unbaffled flasks and grown for 18 

hours at 250 rpm and 30°C (New Brunswick Scientific Model 
25 incubator/shaker) . A 1 ml portion of the resulting 
seed culture was used to inoculate 25 ml of FM in a 250 
ml unbaffled flask and grown for 6 days in the conditions 

10 described above. Fermentation Method B involved 

inoculating 50 ml of RPM in a 250 ml unbaffled flask with 
0.5 ml of spore suspension (10 8 c.f .u./ml) and growing at 
30°C and 250 rpm for 7 days in a New Brunswick Scientific 
Series 25 Incubator Shaker. 

15 SEED medium: 



20 



0.5% (w/v) Sigma corn steep liquor 
4% (w/v) tomato paste 
1% (w/v) oat flour 
1% (w/v) glucose 

1% (v/v) Vogel ' s trace elements 
distilled water to 1 1 



FM: 



25 



4.5% (w/v) glucose 

2.4% (w/v) Sigma peptonized milk 
0.25% (w/v) Difco Bacto yeast extract 
0.25% (w/v) polyethylene glycol 2000 
distilled water up to 1 1 



RPM: 



30 



35 



4% (w/v) lactose 

0.3% (w/v) rapeseed meal 

0.2% (w/v) KN0 3 

0.3% (w/v) KH 2 PO« 

0.05% (w/v) MgS0 4 *7H 2 O 

0.05% (w/v) NaCl 

0.05% (v/v) Sigma antifoam B 

0.05% (v/v) trace elements solution 

pH to 6.5 and made up to 1 1 with distilled water. 
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10 



15 



20 



25 



Trace elements solution is: 

0.16% (w/v) MnS0 4 
0.34% (w/v) ZnS0 4 -7H 2 0 
0.2% (w/v) CoCl 2 -6H 2 0 
0.5% (w/v) FeS0 4 • 7H 2 0 

made up to 1 liter with distilled water. 

The cultures were extracted by adjusting the pH of 

the media to 3 with HCl , adding an equal volume of ethyl 

acetate, and shaking the mixture on a New Brunswick 

Scientific Series 25 incubator/shaker at 250 rpm for 2 

hours. For analysis, 1 ml of the ethyl acetate layer was 

dried under a nitrogen stream and resuspended in 0.1 ml 

of methanol. For TLC analysis 10 fil of this extract was 

run on C-18 reverse phase TLC plates (RP-18 F 254 - Merck) 

in a solvent system of methanol : 0 . 1% phosphoric acid 

(9:1) . TLC plates were developed by spraying with 10% 
phosphomolybdic acid in methanol and heating with a heat 
gun. Extracts were compared with authentic lovastatin, 
monacolin J, monacolin L, and dihydromonacol in L (acid 
and lactone forms) . For HPLC analysis a Waters Nova-Pak 
C l8 (3.9 x 150 mm) column was used with a solvent system 
of acetonitrile (B) and 0.1% phosphoric acid (A). The 
column was eluted with a preprogrammed gradient of 0 to 
100% B into A over 25 minutes using gradient 7 (Waters 
Millenium Software) with a flow rate of 1.5 ml/min and 
metabolites were detected with a Waters 996 Photodiode 
Array Detector; lovastatin was detected at 238 nm. For 
purification of metabolites a Waters Prep Nova-Pak HR C 1B 

(7.8 x 300 mm) column was used. The same solvent system 
as above was used with gradient of 0 to 100% B in A over 
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75 minutes at: a flow rate of 4.5 ml/min. Fractions were 
collected manually, back extracted with ethyl acetate and 
dried. For HPLC-MS an Aquapore OD-300 7 micron (1.0 x 
100 mm) column was used with a gradient of 0 to 100% 
acetonitrile into A (0.05% TFA) over 30 minutes at a flow 
rate of 0.02 ml/min. 
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CLAIMS 

We claim: 

1. A method of increasing the production of 
lovastatin in a lovas tat in-producing organism, comprising 
the steps of transforming the organism with the D4B 
segment, wherein the segment is transcribed and 
translated, and wherein an increase in lovastatin 
production occurs. 

2 . The method of claim 1 wherein the D4B segment 
is the A. terreus D4B segment. 



3. The method of claim 1, wherein the D4B segment 
is identical to nucleotides 579 - 33,000 of SEQ ID NO: 18 
and 1 - 5,34 9 of SEQ ID NO : 1 9 . 

4. The method of claim 1, wherein the lovastatin- 
producing organism is selected from the group consisting 
of A . terreus ATCC 20542 and ATCC 20541. 

5. The method of claim 1, wherein the organism is 
selected from the group consisting of fungi and yeast. 

6. The method of claim 1 wherein the increase is 
at least 2-fold. 
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The method of claim I wherein the nucleic acid 



7 . 



sequence 



is identical to a sequence isolated from ATCC 



98876 . 



8 . 



The method of claim 1 additionally comprising 



transforming the organism with the entire A. terreus 



lovastatin gene cluster. 



9 . 



The method of claim 8 wherein the gene cluster 



comprises SEQ ID NOs:18 and 19. 

10. The method of claim 8 wherein the nucleic acid 
sequence of the gene cluster is identical to sequences 
isolated from ATCC 98876 and 98877. 

11. A method of increasing the production of 
monacolin J in a lovastat in-producing organism, 
comprising the steps of transforming the organism with 
the D4B segment, wherein the segment is translated, and 

5 wherein an increase monacolin J production occurs. 

12 . A method of increasing the production of 
lovastatin in a lovastat in-producing organism, comprising 
the step of transforming the organism with the LovE gene, 
wherein the nucleic acid sequence is translated, and 

5 wherein an increase in lovastatin production occurs. \ 
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13. The method of claim 12 wherein the increase is 

at least 2.0-fold. 



14 . The method of claim 13 wherein the increase is 
at least 5-fold. 



15. The method of claim 12 wherein the nucleotide 
sequence of the LovE gene comprises SEQ ID NO: 27. 

16 . A method of increasing the production of 
lovastatin in a lovastat in-producing organism comprising 
the steps of transforming the organism with a nucleic 
acid sequence comprising a truncated version of the A . 

5 terreus D4B segment, wherein the nucleic acid sequence is 
transcribed and translated and wherein an increase in 
lovastatin production occurs. 



17. A method of increasing the production of 
lovastatin in a lovastat in-producing organism comprising 
the steps of transforming the organism with a nucleic 
acid sequence comprising a truncated version of the A. 
5 terreus lovastat in-producing gene cluster, wherein the 
nucleic acid sequence is transcribed and translated and 
wherein an increase in lovastatin production occurs. 
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18. A method of increasing or conferring the 
production of monacolin J in a non- lovastat in-producing 
organism comprising the steps of transforming the 
organism with a nucleic acid sequence comprising the D4B 

5 segment, wherein the nucleic acid sequence is transcribed 
and translated and wherein an increase in monacolin J 
production occurs. 

19. The method of claim 18 wherein the D4B segment 
is the A. terreus D4B segment. 

20. The method of claim 18 wherein the D4B segment, 
comprises nucleotides 579 - 33,000 of SEQ ID NO: 18 and 1- 
5,349 of SEQ ID NO : 1 9 . 

21. The method of claim 18 additionally comprising 
the step of converting the monacolin J into lovastatin. 

22. The method of claim 18 additionally comprising 
the step of transforming the organism with a nucleic acid 
sequence comprising the LovF gene, wherein the nucleic 
acid sequence is transcribed and translated and wherein 

5 an increase in lovastatin production occurs. 

23. An isolated nucleic acid sequence selected from 
the group consisting of SEQ ID NOs:20 - 36. 
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24 . A lovastat in-producing organism, wherein the 
organism has been genetically modified to have increased 
lovastatin production, wherein the increase is at least 
2-fold. 

25. The organism of claim 24, wherein the organism 
is a yeast or a fungi. 

26. A non-lovastatin producing organism, wherein 
the organism has been genetically modified to produce 
monacolin J. 

27. The organism of claim 26, wherein the organism 
is a yeast or a fungi. 

28. A non-lovastatin producing organism, wherein 
the organism has been genetically modified to produce 
lovastatin . 

29. The organism of claim 28 wherein the organism 
is a yeast or a fungi. 
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<140> 
<141> 

<160> 36 

<170> Patentln Ver . 2.0 

<210> 1 

<211> 1529 

<212> PRT 

<213> Aspergillus terreus 

<4G0> 1 

Met Ala Ser Leu Leu Phe Phe Thr Val Phe Asn Leu Thr Leu Ala Leu 
1 5 10' } 5 

Leu Ser Ser Thr Ala Thr Gly Ala Ala Val Pro Val Ser Arg Pre Thr 
20 25 30 

Asp Asp Ser Arg Tyr lie Asp Phe Asp Ala Ala Glu Trp Arg Pro Arg 
35 40 45 

Ala Lys Arg Asp Asp Ala Leu Lys Val Pro Leu Arg lie Leu Pro Leu 
50 55 60 

Gly Ala Ser lie Thr Tro Gly Tyr Leu Ser Ser Thr Gly Asn G1-- Tyr 
65 70 75 ~ 60 

Arg Lys Pre Leu Arg Asp Lvs Leu Arg Phe Glu Gly Trp Glu Val Asp 
85 90 95 

Met Val Gly Lys Ala His Ser Gly Asp Val He Thr Gin Val Gin Thr 
100 105 110 

Ala Ala Ala Asn Ser Leu Ala Tyr Lys Pro Asn Val Val Leu lie Asn 
115 120 125 

Ala Gly Tnr Asn Asp Cys Asp Tyr Asn Vai Asp Pro Ala Asn Ala Gly 
130 135 140 

Glu Arg Met Arg Ser Leu He Glu Thr Leu He Gly Ala Pro Asp Met 
145 150 155 160 

Ala Asn Thr Leu He Val Leu Ser Thr Leu He Pro Ser Gly Ser Thr 
165 170 175 

Thr Leu Glu Ala Asn Arg Pro Ser Val Asn Ala Gin Phe Arg Glu Leu 
180 185 190 

Val Leu Asp Met Arg Glu Ala Gin Asn Val Ser He Val Leu Ala Asp 
195 200 205 

Met Asp Pre- Pro Ala Pro Ser Pre Glv Asn Asn Trp He Thr Tyr Pro 
210 215 " 220 



1 



1NSDOCID: <WO 0037629A2 I > 



WO 00/37629 PCT/US 99/29583 

Asd Asn Phe Ala Asc Asn Lys His Pro Asn Asd Tyr Gly Tyr Se- Glr 
225 230 235 "* 240 

Met Ala Asp lie Trp Tyr Asn Ala He Tyr Asn Ala Ala Val Ala Glu 
245 25C 255 

"*.y Thr Cvs 

260 265 270 

Asp Lys Glu Tyr Gly Ser Gly Val Tyr Ala Giv Gly Phe Thr Gin Gin 
275 280 " 285 

Gly Ser Gly Glu Asp Asp Gly He Tyr Arg His Asp Ser Glu Tyr Ser 
290 295 300 

.Gly Ala Leu Phe Thr Val Arg Ala Gly Lys Gly Ala Ala Asp Pro Tyr 
305 310 315 320 

Lys Asp Asp .Asp Glu Leu His Phe Phe Phe Gly Arg Leu Tyr THr Arg 
325 330 335 

Ala Tyr Asp Asp Met Met He Phe His Lys Asp Lys Asp Ser Gly Ala 
340 345 350 

Val Thr Phe Val Ser Tyr Thr Asn Asn Val His Thr Glu Glu Gin Glu 
355 360 365 

Phe Thr Lys Gly Gly Thr Phe Ser Thr His Asn Asn Cys Asn Pro Gly 
370 375 380 

Gly Val His Phe He Asp He Asn Gly Asp Gly Leu Asp Asp Tyr He 
385 390 395 400 

Cys He Ala Leu Asp Gly Thr Thr Tyr Ala Ser He Asn Asn Gly Asp 
405 410 415 

Gly Asp Ala Lys Ser Asn Lys Pro Pro Ser Phe Thr Asp He Gly Leu 
420 425 430 

Trp Lys Ser Pro Glu Gly Tyr Asp Gin Ala His Val Arg Leu Ala Asp 
435 440 445 

He Asp Gly Asp Gly Arg Ala Asp Tyr Cys Gly Leu Ala Asp Asn Gly 
450 455 460 

Asp Val Thr Cys Trp Arg Asn Gly Trp He Glu Asp lie Pro Ala Tyr 
465 470 475 480 

Trp Gin Pro Leu Gly Lys Arg Phe Thr Gly Lys Val Met Gly Asd Leu 
4 85 4 90 ' 4 95 

Arg Gly Val Arg Phe Glu Asp He Asn Gly Asp Gly Arg Asp Asd Trp 
500 505 510 

Met Trp Val Asp Asp Asp Gly Ala Thr Thr Thr Tyr Thr Asn Ser Arg 
515 520 525 

Ser Cys He Lys Gly Glu Ser Gly Asp Gly Leu Asn Val Val Trp Arq 
530 535 540 

Gin Gly Phe Tyr Gin Asp Ala Asn Ser Gly Pro Ser His Pro Gly Met 
545 550 555 560 

Gly Val lie Phe Gly Thr Ser Gly Leu Ara Asp Gin Val Tyr Phe Ala 
565 570 575 
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Arg Leu Tyr Gly Glu Val Ala Asp Phe Gly Giu Leu Gly Ara Gin Asp 
580 585 590 

Tyr Val Phe lie Lvs Lys Asp Thr Ser Asd Lys Tyr Phe Gly Pro Leu 
595 600 ' 605 

Tyr Tyr Val His Val Trp Lys Ser Lys Gly Ala Gly Gly Ala Lys lie 
610 615 620 

Lys Ala Asp Gly Asp Arg Tyr Cys Asn Met Met Gly His Asp Asn Gly 
625 630 635 640 

Met Met Asp Tvr lie Trp lie His Ser Thr Gly His Met Arg Leu Tyr 
645 650 655 

Pro Asn Arg Gly Leu Val Glu Val Pro Ala Asp Gly Ser Ser Phe Trp 
660 665 670 

Gly Ala Asn Glu He lie Phe Asp Pro-Gin Glu Gin He Gly Met Lys 
675 680 685 

Leu Asp Arg Arg Asp Leu His Leu Ala Asp Trp Asp Gly Asp Gly Ala 
690 695 700 

Cys Asp He lie Trp Thr Asp Pro Asp Asn Leu Asn Arg Ala Gin Val 
705 710 715 720 

Trp Arg Asn Lys He Lys Asp Thr Gly Ser Phe Asp Trp Asp Tyr Asn 
725 730 735 

lie Asn Ala Ala Asp Glu Leu Tyr Cys Pro Glu His Arg Gly Leu Gly 
740 745 750 

Phe Phe Asp Arg Pro Val His Phe Ala Asp Val Ser Gly Asn Gly Lys 
755 760 765 

Ala Asp Tyr Leu Cys Val Glu Lys Asp Gly Arg Thr Trp Gly Trp Val 
770 775 780 

Asn Gly Asp Asp Gly Trp Asp Tyr lie Asp Gin Phe Lys Tyr Ser Glu 
785 790 795 800 

Glu Lys Asp Arg Ala Asn Leu His Trp Ala Asp Val Asn Gly Asp Gly 
805 810 815 

Lys Ala Asp Met lie Trp Thr Asp Lys Phe Ser Gly Asp Gly Ser Val 
820 825 830 

Trp Tyr Asn Leu Gly Gin Arg Asp lie Lys Gly Ser Arg Tyr Glu Trp 
835 840 845 

Gly Pro Gin Gly Pro Lys Tyr Arg Gly Ala Val Glu Gly Ser Cys Thr 
850 855 860 

Tyr Phe Pro Asp Leu Asn Gly Asp Gly Arg Ala Asp Met His Ser lie 
865 870 875 880 

Trp Asn Ser He Asn Asn Thr Ala Gin Thr Trp Tyr Asn Glu Cvs Ala 
885 890 895 

Thr Lys Asp His Thr Gly Asp Asp Gly Pro lie Thr Asn Pro Asn Leu 
900 905 910 

Pro Val Ser Pro Val Lys Ala Pro He Glu Leu Thr Pro His Tyr Gin 
915 920 925 
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Asp Asn Ser Giu Cys Thr Arg Aia Gin Vai Gir, Tr.r Leu Phe Giu G'u 

935 940 

Met Gin Tyr Ala Leu Asp Aia Ala Ser Giu Vai Ala Tyr Phe Ser Gly 
945 950 955 9 6 o 

Gly Ala Tyr Asp Pro Tyr Arg Asp lie Phe Phe Aia Giu Ser Leu Thr 
965 g70 975 

Asp Ser Leu Thr Phe Thr lie Asn Vai Arg Tyr Thr Phe Asd Arg Met 
560 985 990 

vai Thr Met iie Ser Gly Ser Ser Gin Phe Asp Asp Giu Lys Phe Thr 
S95 1000 100 5 

Ile in'i , n CyS LyS Asn LSU Arg Gly Cys As P Glu Asn Gly Trp Leu Ala 

1015 1020 

Met Met Asn Asn Arg Asn Arg Leu Asn Phe Cys Pro Lys Phe Phe Thr 
1025 . 1030 1035 1040 

Asp Glu Leu Lys Ser Ser Arg Arg Thr Arg Asp Tyr Vai Tyr Glv ^Tro 
1045 10 50 10 55 

Lys Gly Aia^rg Asp Leu Ala Ala Gly Thr Phe Asn Arg His Cys lie 

1065 1070 

Glu Arg Gly Arg Lys Ala Glu Arg Ala Ala Asn Glu Leu Arg lie Ala 
1075 1080 10 85 

Gly inon TrP Gln n n^ Arg LeU Leu C ^ s Pro As P p ro Asn Asn 

iuyu 1095 1100 

Leu Gly Gin Giu Gly lie Cys Asp Ser Lys Leu Ser Ala Tyr Asn Ala 
Llub lli0 1115 H20 

Asp Ser Trp Aia Leu Vai Vai Leu Gly Gly Tyr Tyr Thr Lys lie Cys 
1125 H30 J 1135 y 

Gly Arg Gin lie Pro Leu Pro Glu Glu Ser Ala Ser Ser Ala Asp Asp 
1140 1145 i 150 F P 

Ser Ser Cys Pro Ala Tyr Asp Asp Ser Ser Tyr Asp Ala Asp Thr Vai 
11-'-' 1160 1165 

Tyr Gly Vai Asn Asp Tyr Vai His Phe Gly Asp Ser Tyr Aia Ala Gly 
L±/l} 1175 1180 

Met Gly Thr Gly Thr Thr Thr Giy Asp Ser Cys Arg Vai Gly Ser Asn 
1185 il9 ° H95 1200 

Ser Tyr Gly Lys Leu Vai Gin Glu Trp Phe Asp Thr Glu Asp Phe Thr 
1205 1210 1215 



Tyr Thr Asn Tyr Ala Cys Ser Gly Asp Thr Thr Vai Gly Leu Asn Ly- 
1220 1225 1230 

Lys He Asp Gin Trp Leu Gly Gin Asp Fro Thr Gly Thr Thr Met Ala 

1240 124 5 

Thr Leu Thr lie Giy Giy Asn Asp Vai Phe Phe Ser Asp Leu Vai Ser 

12;5 X260 

Asr^Cys Vai Leu Thr Met Trp Trp Tyr Ser Leu Glu Gin Tyr Arg Gin 

^ 1275 128O 
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Trp Cys Leu GIu Thr Glu GIu Lys Ala Arc Asn Leu Met Gin Asp Thr 
1265 12 9 C 12 9 5 

GIv Ser Asp Gly Leu GIv Ser Lvs Leu Arc Ala Aia T vr GIu Lvs lie 
1300 " 1305 " " 1310 

Leu Asp Ara Ser GIv Ser Ser Val Tyr Leu Pro Val lie Leu lie Tyr 
1315 1320 1325 

Ser Cys Arg Ala Val Leu Ara Ara Ala Aso Phe Thr Leu Val Vai Gin 
1330 1335 1340 

Pro Leu Arg Pro Trp Leu Cys His Leu Leu Gin Ara Arg His His Ara 
1345 1350 1355 " 1360 

Leu Arg Leu Asn His Leu Leu Glu Leu Asn Asp Leu Val Arg Met Leu 
1365 1370 1375 

Asn Ser Leu lie Gin Ser Thr lie Ser. Asp lie Asn Thr Ala Ara Asn 
1380 1385 1390 

Thr Glu Gin lie His Tyr lie Asp Met Asp Ala Arg Phe Asp GIv His 
1395 1400 1405 

Arg Trp Cys Glu Pro Gly Thr Gin Glu Pro Asp Pro Asp Asn Pro Asn 
1410 1415 1420 

Thr Tyr Phe Phe Leu Ser Ala Trp Pro Asp He Ala He Val Giv Asp 
1425 1430 1435 ~1440 

Thr Thr Ala Glu Ser Thr Asn Ala Thr Glu Thr Asp Glu He Thr Ala 
1445 1450 1455 

Leu Met Asn Ser Gly Ser He Gin Leu Pro Asp Ala Asp Thr Cys Gin 
1460 1465 1470 

Asp Ala Leu Gly Ser Asp Pro Asp Pro Tyr Ala Val Phe Met Cys Asp 
1475 1480 1485 

Val Ala Vai His Val Lys Ala Asn Ser Ser Ser Leu He Ala Gin Ser 
1490 1495 1500 

Leu Asp Arg Ala Asn Gin Ala He Ala Asn Arg Asp Tyr Ser Ser Gin 
1505 1510 1515 1520 

Asp Val Ser Trp Trp Leu Pro Ser Pro 
1525 

<210> 2 
<211> 328 
<212> PRT 

<213> Aspergillus terreus 
<400> 2 

Met Thr Leu Pro Thr Leu Pro Asn Trp He Arg Met Cys Val His Leu 
15 10 15 

Ser Leu Thr His Leu His Gin His Arg Ser Pro Lys Tyr Glu Ser He 
20 25 30 

Pro He Lys Ser lie Gin Ala Asn Ser His Arg lie Leu lie lie Leu 
35 40 45 

Thr Thr Ala Ser Phe Tyr Pro Gin lie Arg Cys lie Gin Leu Ara Asn 
50 55 60 
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Ser Thr His Giy lie Ser Thr Ala Tyr lie Leu Phe Asn Leu lie Ser 
65 70 75 80 

Ala Thr Glu His Phe Thr He Leu Phe Ala Leu Leu Val Asn Se- Gi\' 
65 90 95 

Gly Asp Val Leu lie His Glu Pro Pro Thr Thr Giy Aso Giy Leu A*n 
100 105 " HO 

Leu Tyr Gl.n Leu Phe Ala Val Trp Met Gly Cys Leu Val Leu Phe Cys 
115 120 125 

Gin Ala He His Ser Leu His Ala Asn Pro Ara Arg Lvs Leu lie Leu 
130 135 " 140 

Leu Thr lie Tyr He Gin Tyr Leu Cys lie Ser He Leu Pro Glu Val 
145 150 155 160 

He Asp Ala lie' Thr Thr Pro Glu Glu Thr Ara Lys Gin Ara Pro Pro 
165 170 " " 175 

Thr Gly Glu Arg Asn Trp Leu He Gly Leu Phe Leu Ser Ala His Ala 
180 185 190 

Met Thr Val Leu Pro Leu Ser Ala Val Leu Arg He Ala Gly Phe lie 
195 200 205 

Asp Gin Ser Arg Leu He Ser Arg Arg Arg Arg Glu Gin Pro Ser Val 
210 215 220 

Leu Ser Leu Thr Gly Leu Ala Cys Gin Ala Val Val Phe Ala Leu Val 
225 230 235 240 

Ser Gly Leu Trp Val Leu Arg Val Gin Gin Pro Val Pro Arg Met Pro 
245 250 255 

Met Arg Arg Pro Val Asp Trp Met Tyr Trp Tyr His Val He Gly Trp 
260 265 270 

Pro Val Val Asp Asp Ala Val Tyr Ala Leu Gly Gin Trp Val Leu Phe 
275 280 285 

Trp Tyr Ala Val Cys Trp Arg Ser Arg Gly Asp Ala Arg Asp Glu Ala 
290 295 300 

Val His Ala Gly Glu Thr Asp Asp Leu Leu Gly Glu Asp Glu Glv His 
305 310 315 " 320 

Gly Tyr Gly Gly Thr Gly Thr Ser 
325 

<210> 3 
<211> 323 
<212> PRT 

<213> Aspergillus terreus 
<400> 3 

Met Val Gly Ser Lys Leu Ala His Asn Glu Glu Trp Leu Asp He Ala 
15 10 IS 

Lys His His Ala Val Thr Met Ala lie Gin Ala Arg Gin Leu Arq Leu 
20 25 30 

Trp Pro Val lie Leu Arg Pro Leu Vai His Trc Leu Glu Pro Gin Gly 
35 40 * 45 



6 



NJSDOCID: <WO 0037629A2 f > 



VVO 00/37629 PCT7US99/29583 

Ala Lys Leu Ara Ala Gin Val Ara Ara Ala Ara Gin Leu Leu Asp Pro 
SC ' 55 ' 6G 

He He Gin Glu Arg Arg Ala Glu Arc Asc Ala Cvs Ara Ala Lys Gly 
65 70 75 - " ~ 80 

He Glu Pro Pro Arg Tyr Val Asp Ser He Gin Trp Phe Glu Asp Thr 
85 90 95 

Ala Lys Gly Lys Trp Tyr Asp Ala Ala Gly Ala Gin Leu Ala Met Asp 
100 105 110 

Phe Ala Gly He Tyr Gly Thr Ser Asp Leu Leu He Gly Glv Leu Val 
115 120 * 125 

Asp He Val Arg His Pro His Leu Leu Glu Pro Leu Arg Asp Glu lie 
130 135 140 

Arg Thr Val He Gly Gin Gly Gly Trp Thr Pro Ala Ser Leu Tyr Lys 
145 150 155 160 

Leu Lys Leu Leu Asp Ser Cys Leu Lys Glu Ser Gin Arg Val Lys Pro 
165 170 175 

Val Glu Cys Ala Thr Met Arg Ser Tyr Ala Leu Gin Asp Val Thr Phe 
180 185 190 

Ser Asn Gly Thr Phe He Pro Lys Gly Glu Leu Val Ala Val Ala Ala 
195 200 205 

Asp Arg Met Ser Asn Pro Glu Val Trp Pro Glu Pro Ala Lvs Tyr Asp 
210 215 220 

Pro Tyr Arg Tyr Met Arg Leu Arg. Glu Asp Pro Ala Lys Ala Phe Ser 
225 230 235 240 

Ala Gin Leu Glu Asn Thr Asn Gly Asp His He Gly Phe Gly Trp His 
245 250 255 

Pro Arg Ala Cys Pro Gly Arg Phe Phe Ala Ser Lys Glu He Lys Met 
260 265 270 

Met Leu Ala Tyr Leu Leu lie Arg Tyr Asp Trp Lys Val Val Pro Asp 
275 280 285 

Glu Pro Leu Gin Tyr Tyr Arg His Ser Phe Ser Val Arg He His Pro 
290 295 300 

Thr Thr Lys Leu Met Met Arg Arg Arg Asp Glu Asp He Arg Leu Pro 
305 310 315 320 

Gly Ser Leu 



<210> 4 
<211> 256 
<212> PRT 

<213> Aspergillus terreus 
<400> 4 

Met Arg Tyr Gin Ala Ser Pro Ala Leu Val Lys Ala Pro Arg Ala Leu 
15 10 15 

Leu Cys He His Gly Ala Gly Cys Ser Pro Ala He Phe Arg Val Gin 
20 25 30 
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Leu Ser Lys Leu Arg Ala Ala Leu Arc GIu Asn Phe Glu Phe Val Tyr 
35 40 45 

Val Thr Ala Pro Phe Pre Ser Ser Ala Gly Pro Gly He Leu Pro Val 
50 55 60 

Phe Ala Asp Leu Gly Pre Tyr Tyr Ser Trp Phe Glu Ser Ser Ser Asp 
65 70 75 80 

Asn Asn His Asn Gly Pro Ser Val Ser Glu Arg Leu Ala Ala Val His 
85 90 95 

Asp Pro He Arg Arg Thr He Val Asp Trp Gin Thr Gin His Pro His 
100 105 110 

He Pro He Val Gly Ala He Gly Phe Ser Glu Gly Ala Leu Val Thr 
115 12C 125 

Thr Leu Leu Leu Trp Gin Gin Gin Met Gly His Leu Pro Trp Leu Pro 
130 135 140 

Arg Met Ser Val Ala Leu Leu He Cys Pro Trp Tyr Gin Asp Glu Ala 
1^5 150 155 160 

Ser Gin Tyr Met Arg Asn Glu Val Met Lys Asn His Asp Asp Aso Asn 
165 170 175 

Asp Ser Lys Asp Thr Glu Trp Gin Glu Glu Leu Val He Arg He Pro 
180 185 190 

Thr Leu His Leu Gin Gly Arg Asp Asp Phe Ala Leu Ala Gly Ser Lys 
195 200 205 

Met Leu Val Ala Arg His Phe Ser Pro Arg Glu Ala Gin Val Leu Glu 
210 215 220 

Phe Ala Gly Gin His Gin Phe Pro Asn Arg Pro Arg Asp Val Leu Glu 
225 230 235 240 

Val He Asn Arg Phe Arg .Lys Leu Cys Val Thr Ala Gin Thr Leu Glu 
245 ^ 250 255 



<210> 5 
<211> 363 
<212> PRT 

<213> Aspergillus terreus 



Met Gly Asp Gin Pro Phe He Pro Pro Pro Gin Gin Thr Ala Leu Thr 
15 10 15 

Val Asn Asp His Asp Glu Val Thr Val Trp Asn Ala Ala Pro Cys Pro 
20 25 30 

Met Leu Pro Arg Asp Gin Val Tyr Val Arg Val Glu Ala Val Ala He 
35 40 45 

Asn Pro Ser Asp Thr Lys Met Arg Gly Gin Phe Ala Thr Pro Trp Ala 
50 55 60 

Phe Leu Gly Thr Asp Tyr Ala Gly Thr Val Val Ala Val Gly Ser Asp 
65 70 75 80 

Val Thr His lie Gin Val Gly Asp Arg Val Tyr Gly Ala Gin Asa Glu 



<400> 5 



85 



90 



95 



NSDOCID: <WO 0037629A2 I > 



WO 00/37629 PCT/US99/29583 

Met Cys Pro Arg Thr Pro Asp Gin Glv Ala ?he Ser Gin Tyr Thr Val 
100 105 110 

Thr Arg Gly Arg Val Trp Ala Lys He Pro Lys Gly Leu Ser Fhe Glu 
115 120 " 125 

Gin Ala Ala Ala Leu Pro Ala Gly He Ser Thr Ala Gly Leu Ala Met 
130 1j= 140 

Lys Leu Leu Gly Leu Pro Leu Pro Ser Pro Ser Ala Asp Gin Pro Pro 
145 150 155 160 

Thr His Ser Lys Pro Val Tyr Val Leu Val Tyr Gly Gly Ser Thr Ala 
165 170 175 

Thr Ala Thr Val Thr Met Gin Met Leu Arg Leu Ser Gly Tyr I'e Pro 
180 185 190 

He Ala Thr Cys Ser Pro His Asn Phe Asp Leu Ala Lys Ser Arg Gly 
195 200 205 

Ala Glu Glu Val Phe Asp Tyr Arg Ala Pro Asn Leu Ala Gin Thr He 
210 215 220 

Arg Thr Tyr Thr Lys Asn Asn Leu Arg Tyr Ala Leu Asp Cys He Thr 
225 230 235 240' 

Asn Val Glu Ser Thr Thr Phe Cys Phe Ala Ala He Gly Arg Ala Gly 
245 250 255 

Gly His Tyr Val Ser Leu Asn Pro Phe Pro Glu His Ala Ala Thr Arg 
260 265 270 

Lys Met Val Thr Thr Asp Trp Thr Leu Gly Pro Thr He Phe Gly Glu 
275 280 285 

Gly Ser Thr Trp Pro Ala Pro Tyr Gly Arg Pro Gly Ser Glu Glu Glu 
290 295 300 

Arg Gin Phe Gly Glu Asp Leu Trp Arg He Ala Gly Gin Leu Val Glu 
305 310 315 320 

Asp Gly Arg Leu Val His His Pro Leu Arg Val Val Gin Gly Gly Phe 
325 330 335 

Asp His He Lys Gin Gly Met Glu Leu Val Arg Lys Gly Glu Leu Ser 
340 345 350 

Gly Glu Lys Leu Val Val Arg Leu Glu Gly Pro 
355 360 



<210> 6 
<211> 413 
<212> PRT 

<213> Aspergillus terreus 
<400> 6 

Met Gly Ser He lie Asp Ala Ala Ala Ala Ala Asp Pro Val Val Leu 
15 10 15 

Met Glu Thr Ala Phe Arg Lys Ala Val Lys Ser Arg Gin He Pro Gly 
20 25 30 

Ala Val He Met Ala Arg Asp Cys Ser Gly Asn Leu Asn Tyr Thr Arg 
35 40 45 
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Cys Phe Giy Ala Arc Thr Val Arc Ara Asp Glu Cys Asn Giy Leu Pre 
50 ' 55 60 

Pro Leu Gin Val Asp Thr Pro Cys Arc Leu Ala Ser Ala Thr Lys Leu 
65 70 " " 75 80 

Leu Thr Thr lie Met Ala Leu Gin Cys Met Glu Arg Giy Leu Val Asp 
85 90 95 

Leu Asp Glu Thr Val Asp Arg Leu Leu Pro Asp Leu Ser Ala Met Pro 
100 105 110 

Val Leu Glu Giy Phe Asp Asp Ala Giy Asn Ala Aro Leu Ara Glu Arc 
115 120 " 125 

Arg Giy Lys lie Thr Leu Arg His Leu Leu Thr His Thr Ser Giy Leu 
130 135 140 

Ser Tyr Val Phe Leu His Pro Leu Leu. Arg Glu Tyr Met Ala Gin Giy 
145 150--*' 155 160 

His Leu Gin Ser Ala Glu Lys Phe Giy lie Gix Ser Arg Leu Ala Pro 
165 170 175 

Pro Ala Val Asn Asp Pro Giy Ala Glu Trp lie Tyr Giy Ala Asn Leu 
180 165 190 

Asp Trp Ala Giy Lys Leu Val Giu Arg Ala Thr Giy Leu Asp Leu Glu 
195 200 205 

Gin Tyr Leu Gin Glu Asn lie Cys Ala Pro Leu Giy lie Thr Asp Met 
210 215 220 

Thr Phe Lys Leu Gin Gin Arg Pro Asp Met Leu Ala Arg Arg Ala Asp 
225 230 235 240 

Gin Thr His Arg Asn Ser Ala Asp Giy Arg Leu Arg Tyr Asp Asp Ser 
245 250 255 

Val Tyr Phe Arg Ala Asp Giy Glu Glu Cys Phe Giy Giy Gin Giy Val 
260 265 * 270 

Phe Ser Giy Pro Giy Ser Tyr Met Lys Val Leu His Ser Leu Leu Lvs 
275 280 285 

Arg Asp Giy Leu Leu Leu Gin Pro Gin Thr Val Asp Leu Met Phe Gin 
290 295 300 

Pro Ala Leu Glu Pro Arg Leu Glu Glu Gin Met Asn Gin His Met Asp 
305 310 315 320 

Ala Ser Pro His lie Asn Tyr Giy Giy Pro Met Pro Met Val Leu Arg 
325 330 335 

Arg Ser Phe Giy Leu Giy Giy He He Ala Leu Glu Asd Leu Asp Giy 
340 345 * 350 

Glu Asn Trp Arg Arg Lys Giy Ser Leu Thr Phe Giy Giy Giy Pro Asn 
355 360 365 

He Val Trp Gin He Asp Pro Lys Ala Giy Leu Cys Thr Leu Ala Phe 
370 375 380 

Phe Gin Leu Glu Pro Trp Asn Asd Pro Val Cys Arg Asd Leu Thr Arg 
385 390 395 * 400 
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Thr Phe Glu His Ala lie Tyr Ala Gin Tvr Gin Gin Giy 
405 41C 



<210> 7 
<211> 1068 
<212> PRT 

<213> Aspergillus terreus 
<400> 7 

Met Asp Fro Vai Val Arg Lys Pro Asp Pro Gly Gly Val Gin His Arg 
i 5 10 15 

Val Thr Lys Ala Leu Arg Ala lie Val Gly His Ala Cys Arg -His Pro 
20 25 30 

He His Thr Leu Leu Val Thr Ala Leu Thr Ala Ala Thr Thr His Leu 
35 40 45 

His Val Leu Glu Gly Thr Tyr Gin Ala Thr His Arg Glu Ala Ser Ala 
50 55 60 

Trp Lys Trp Gin He Asp Asp Arg Pro Lys Val Pro Glu Asp Gly Gin 
65 70 75 80 

Ser Asp Phe His Trp Ala Leu Val Thr Leu Asp Leu Pro Gly Ala Ser 
85 90 95 

Val Asp Ala Ser He Pro Phe Leu Ser Asn Thr Leu Ser Gly Phe Leu 
100 105 110 

Gly Ala Glu Gin Thr Thr Pro Thr Pro Asp Ser Ser Pro Ser Pro Asp 
115 120 125 

His Ser Ala Leu Thr Phe Arg Val Pro Tyr Ser Gin Leu Asp Gly Phe 
130 135 l40 

Leu Gin Ala Val Glu He lie Pro Ser Glu Lys Glu Asp Asp Ser Trp 
1^5 150 155 160 

Arg Leu Arg Ser Pro Arg Glu Glu Gly Ser Pro Arg Ser Leu Gly His 
165 170 175 

Trp Leu Giy Ser Ser Trp Leu Ser Phe Leu His Arg Val His His Ala 
180 185 190 

Glu Thr Val Asp Leu Val He lie Gly Leu Ser Tyr Leu Ala Met Asn 
195 200 205 

Met Thr Val Val Ser Leu Phe Arg Val Met Arg His Leu Gly Ser Arg 
210 215 220 

Phe Trp Leu Ala Ala Ser Val Leu Leu Ser Gly Ala Phe Ala Phe Val 
225 230 235 240 

Leu Gly Leu Gly lie Thr Thr Thr Cys Asp Val Pro Val Asp Met Leu 
245 250 255 

Leu Leu Phe Glu Gly He Pro Tyr Leu Val Leu Thr Val Gly Phe Glu 
260 265 270 

Lys Pro lie Gin Leu Thr Arg Ala Val Leu Cys Val Ser Glu Glu Leu 
275 280 285 

Trp Gly Gly Gly. Gin Arg Gin Val Pro Asn Gly Ala Ser Ser Asp Asp 



290 



295 



300 
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Ser Arg Gin Asn Gin Leu lie Pro Asn lie lie Gin Leu Ala Val Asp 
305 310 315 320 

Arg Glu Giy Trp Tyr lie Val Arc Ser Tvr Leu Leu Glu lie Gly Ala 
325 330 335 

Leu Ala Leu Gly Ala Val Leu Arc Fro Lys Asp Ser Leu Gly His Phe 
340 345' 350 

Cys Phe Leu Ala Ala Trp Thr Leu Leu lie Asp Ala Val Leu Leu Phe 
355 360 365 

Thr Phe Tyr Ala Thr He Leu Cys Val Lys Leu Glu He Thr Arq He 
370 375 380 

Arg Ser Pro Gly Gly Leu Gly Gin Val Asn Ala Lys His Pro Ser Glv 
385 390 395 400 

He Phe Gly His Lys Val Lys Ser Thr. Asn He Thr Trp Trp Lys Leu 
^05 - 410 415 

Leu Thr Val Gly Gly Phe Val Leu Cys His Phe Leu Gin Leu Ser Pro 
420 425 430 

Phe Phe Tyr Arg Val Met Gly Glu Tyr Met Ala Asn Gly Thr Leu Pro 
435 440 445 

Pro Thr Ala Val Ser Pro Phe Lys Glu Ala Ala Asn Gly Leu Asn Glu 
450 455 460 

He Tyr Leu Thr Ala Arg Val Glu Gly Phe Glu Thr Arg Val Thr Val 
465 470 475 480 

Leu Pro Pro Leu Gin Tyr Val Leu Glu Ser Ala Gly Phe Asn He Ser 
485 490 495 

Ala Thr Lys Arg Ser Thr Phe Asp Gly Val Leu Asp Gly Leu Glu Ser 
500 505 510 

Pro Leu Gly Arg Leu Cys Leu Met Gly Ala Leu Val Val Ser Leu Val 
515 520 525 

Leu Asn Asn His Leu lie His Ala Ala Arg Trp His Ala Trp Pro Gin 
530 535 540 

Ala Arg Glu Ser Ala Val Pro Asp Gly Ser Tyr Leu Ser Val Pro Cvs 
54J > 550 555 560 

Ser Ala Thr Ala Pro Glu Val Cys Thr Arg Pro Pro Glu Glu Thr Glu 
565 570 575 

Ala Leu Leu Lys Ser Asn Gin Ala Glu Ser Leu Thr Asp Asp Glu Leu 
580 585 590 

Val Glu Leu Cys Leu Arg Gly Lys He Ala Gly Tyr Ser Leu Glu Lys 
595 600 . 605 

Thr Leu Glu Arg He Ala Ala Gly Ser Ser Arg Ser Val Thr Arq Leu 
610 615 620 

Glu Ala Phe Thr Arg Ala Val Arg He Arg Arq Ala Ala Val Ser Lvs 
625 630 635 640 

Thr Pro Ser Thr Gin Asn Leu Cys Ser Gly Leu Ala Glu Ser Leu Leu 
645 650 655 
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Pro Tvr Arg Asd Tyr Asn Tvr Giu Leu Val His Gly Ala Cvs Cvs Giu 
660 665 670 

Asn Val Val Gly Tyr Leu Pro Leu Pro Leu Gly Val Ala Gly Pro Met 
675 680 685 

Val lie Asp Gly Gin Ala Leu Phe He Pro Met Ala Thr Thr GIu Gly 
690 695 700 

Vai Leu Val Ala Ser Ala Ser Arg Gly Cys Lys Ala He Asn Ala Glv 
~ - ; 5 710 715 72 C 

31 y Glv Ala Thr Thr Met Leu Lys Glv Aso Glv Met Thr Arg Glv Pre 
725 " 730 " " 735 

Cys Leu Arg Phe Pro Ser Ala Gin Arg Ala Ala GIu Ala Gin Arg Trp 
740 745 750 

Val GIu Ser Pro Leu Gly His GIu Val Leu Ala Ala Ala Phe Asn Ala 
755 760 765 

Thr Ser Arg Phe Ala Arg Leu Gin Thr Leu Thr Val Ala Gin Ala Gly 
770 775 780 

He Tvr Leu Tyr lie Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly 
755 790 795 800 

Met Asn Met He Ser Lys Gly Val GIu Lys Ala Leu GIu Ala Met Ala 
805 810 815 

Ala GIu Gly Gly Phe Pro Asp Met His Thr Val Thr Leu Ser Gly Asn 
820 825 830 

Phe Cys Ser Asp Lys Lvs Ser Ala Ala He Asn Trp lie Gly Gly Arg 
835 84C * 845 

Glv Lvs Ser Val He Ala Giu Ala Thr He Pro Ala GIu Thr Vai Ara 
850 855 860 

Gin Val Leu Lys Thr Asp Val Asp Ala Leu Val GIu Leu Asn Thr Ala 
865 870 875 880 

Lys Asn Leu Val Gly Ser Ala Met Ala Gly Ser Leu Gly Gly Phe Asn 
885 890 895 

Ala His Ala Ser Asn Leu Val Gin Ala Val Phe Leu Ala Thr Gly Gin 
900 905 910 

Asc Pro Ala Gin Asn Val GIu Ser Ser Ser Cys He Thr Thr Met Lys 
915 920 925 

Asn He Asp Gly Asn Leu His He Ala Val Ser Met Pro Ser Met GIu 
930 935 940 

Val Gly Thr He Gly Gly Gly Thr He Leu GIu Ala Gin Gly Ala Met 
945 950 955- 960 

Leu Asp Leu Leu Gly Val Arg Glv Ala His Ser Thr GIu Pro Gly Ala 
965 " 970 975 

Asn Ala Arg Ara Leu Ala Arg He Vai Ala Ala Ala Val Leu Ala Glv 
980 985 990 

Giu Leu Ser Thr Cys Ala Ala Leu Ala Ala Gly His Leu Val Asn Ala 
995 1000 1005 
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His Met Gin His Asa Ara Thr Ser Lys Aso Ala lie Ser Gly Thr Giu 
1010 1015 " 1020 

Tyr Gly Ala lie Arg Thr Pro Val -Tyr Val Vai lie Leu Giu~His Ala 
1025 1030 ■ 1035 1040 

Gly Asp lie His Phe Vai Gin lie Glu Tyr Lys Asn Thr Tyr Leu Ara 
1045 1050 1055 

Arg Lys Val Pro Thr Leu Ser Cys Asn Leu Giy Arg 
1060 1065 

<210> 8 
<211> 503 
<212> PRT 

<213> Aspergillus terreus 

<400> 8 ■■ ■ 

Met Ala Ala Asp Gin Gly lie Phe Thr Asn Ser Val Thr Leu Ser Pro 
15 10 15 

Val Glu Gly Ser Arg Thr Gly Gly Thr Leu Pro Arg Arg Ala Phe Arg 
20 25 30 

Arg Ser Cys Asp Arg Cys His Ala Gin. Lys lie Lys Cys Thr Gly Asn 
35 40 45 

Lys Glu Val Thr Gly Arg Ala Pro Cys Gin Arg Cys Gin Gin Ala Gly 
50 55 60 

Leu Arg Cys Val Tyr Ser Glu Arg Cys Pro Lys Arg Lys Leu Arg Gin 
65 70 75 80 

Ser Arg Ala Ala Asp Leu Val Ser Ala Asp Pro Asp Pro Cys Leu His 
85 90 95 

Met Ser Ser Pro Pro Val Pro Ser Gin Ser Leu Pro Leu Asp Val Ser 
100 105 110 

Glu Ser His Ser Ser Asn Thr Ser Arg Gin Phe Leu Asp Pro Pro Asp 
115 120 125 

Ser Tyr Asp Trp Ser Trp Thr Ser He Gly Thr Asp Glu Ala He Asp 
130 135 140 

Thr Asp Cys Trp Gly Leu Ser Gin Cys Asp Gly Gly Phe Ser Cys Gin 
1^5 150 155 160 

Leu Glu Pro Thr Leu Pro Asp Leu Pro Ser Pro Phe Glu Ser Thr Val 
165 170 175 

Glu Lys Ala Pro Leu Pro Prp Val Ser Ser Asp He Ala Arg Ala Ala 
180 185 190 

Ser Ala Gin Arg Glu Leu Phe Asp Asp Leu Ser Ala Val Ser Gin Glu 
195 200 205 

Leu Glu Glu He Leu Leu Ala Val Thr Val Glu Trp Pro Lys Gin Glu 
210 215 220 

lie Trp Thr Arg Ala Ser Pro His Ser Pro Thr Ala Ser Arg Glu Arg 
225 230 235 240 

He Ala Gin Arg Arg Gin Asn Val Trp Ala Asn Trp Leu Thr Asp Leu 
245 250 255 
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Has Met Fhe Ser Leu Asp Pro lie Gly Met Fhe Phe Asn Ala Ser Arq 
260 265 270 

Arg Leu Leu Thr Val Leu Arg Gin Gin Ala Gin Ala Asp Cys His Gin 
2/5 280 285 

Gly Thr Leu Asp Glu Cys Leu Ara Thr Lvs Asn Leu Phe To- Ala Va 1 
290 295 " 300 

His Cys Tyr lie Leu Asn. Val Arg lie Leu .Thr Ala He Ser Glu Leu 
305 310 315 320 

Leu Leu Ser Gin He Arg Arg Thr Gin Asn Ser His Met Se^ Pro Leu 
325 330 335 

Glu Gly Ser Arg Ser Gin Ser Pro Ser Arg Asp Asp Thr Ser Ser Ser 
340 345 350 

Ser Gly His Ser Ser Val Asp Thr He Pro Phe Phe Ser Glu Asn Leu 
355 360 365 

Pro He Gly Glu Leu Phe Ser Tyr Val Asp Pro Leu Thr His Ala Leu 
370 375 380 

Phe Ser Ala Cys Thr Thr Leu His Val Gly Val Gin Leu Leu Arg Glu 
385 390 395 400 

Asn Glu He Thr Leu Gly Val His Ser Ala Gin Gly He Ala Ala Ser 
405 410 415 

He Ser Met Ser Gly Glu Pro Gly Glu Asp He Ala Arg Thr Gly Ala 
420 425 430 

Thr Asn Ser Ala Arg Cys Glu Glu Gin Pro Thr Thr Pre Ala Ala Arg 
435 440 445 

Val Leu Phe Met Phe Leu Ser Asp Glu Gly Ala Phe Gin Glu Ala Lys 
450 455 460 

Ser Ala Gly Ser Arg Gly Arg Thr He Ala Ala Leu Arg Arg Cys Tyr 
465 470 475 480 * 

Glu Asp lie Phe Ser Leu Ala Arg Lys His Lys His Gly Met Leu Arg 
485 490 ' 495 

Asp Leu Asn Asn lie Pro Pro 
500 

<210> 9 
<211> 542 
<212> PRT 

<213> Aspergillus terreus 
<400> 9 

Met Thr Ser His His Gly Glu Thr Glu Lys Pro Gin Ser Asn Thr Ala 
1 5 10 15 

Gin Met Gin He Asn His Val Thr Gly Leu Arg Leu Gly Leu Val Val 
20 25 30 

Val Ser Val Thr Leu Val Ala Phe Leu Met Leu Leu Asp Met Ser He 
35 40 45 

He Val Thr Ala He Pro His He Thr Ala Gin Phe His Ser Leu Gly 
50 55 60 
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Asp Val Gly Trp Tyr Gly Ser Ala Tyr Leu Leu Ser Ser Cys Ala Leu 
65 70 75 80 

Gin Pro Leu Ala Gly Lys Leu Tyr Thr Leu Leu Thr Leu Lys Tyr Thr 
85 90 95 

Phe Leu Ala Phe Leu Gly Leu Phe Glu He Glv Ser Val Leu Cvs Gly 
100 105 110 

Thr Ala Arg Ser Ser Thr Met Leu He Val Gly Arg Ala Val A.La Gly 
115 120 125 

Met Gly Gly Ser Gly Leu Thr Asn Gly Ala He Thr He Leu Ser Ala 
130 135 140 

Ala Ala Pro Lys Gin Gin Gin Pro Leu Leu He Gly He Met Met Gly 
145 150 155 160 

Leu Ser Gin lie Ala He Val Cys Gly. Pro Leu Leu Gly Gly Ala Phe 
165 170 175 

Thr Gin Kis Ala Ser Trp Arg Trp Cys Phe Tyr He Asn Leu Pre lie 
180 185 190 

Gly Ala Phe Ala Thr Phe Leu Leu Leu Val lie Gin He Pro A<*n Arg 
195 200 205 

Leu Pro Ser Thr Ser Asp Ser Thr Thr Asp Gly Thr Asn Pro Lys Arq 
210 215 220 

Arg Gly Ala Arg Asp Val Leu Thr Gin Leu Asp Phe Leu Gly Phe Val 
225 230 235 240 

Leu Phe Ala Gly Phe Ala lie Met He Ser Leu Ala Leu Glu Trp Gly 
2^5 250 255 

Gly Ser Asp Tyr Ala Trp Asn Ser Ser Val lie lie Gly Leu Phe Cys 
260 265 270 

Ala Ala Gly Val Ser Leu Val Leu Phe Gly Cys Trp Glu Arg His Val 
275 280 285 

Gly Gly Ala Val Ala Met lie Pro lie Ser Val Ala Ser Arg Ara Gin 
290 295 300 

Val Trp Cys Ser Cys Phe Phe Leu Gly Phe Phe Ser Gly Ala Leu Leu 
305 310 315 320 

lie Phe Ser Tyr Tyr Leu Pro lie Tyr Phe Gin Ala Val Lys Asn Val 
325 330 335 

Ser Pro Thr Met Ser Gly Val Tyr Met Leu Pro Gly He Gly Gly Gin 
340 345 350 

lie Val Met Ala He Val Thr Gly Ala lie lie Gly Lys Thr Gly Tyr 
355 360 365 

Tyr Val Pro Trp Ala Leu Ala SerGly lie Leu Val Ser lie Ser Ala 
370 375 380 

Gly Leu Val Ser Thr Phe Gin Pro Glu Thr Ser He Ala Ala Trp Val 
385 390 395 400 

Met Tyr Gin Phe Leu Gly Gly Val Gly Arg Gly Cys Gly Met Gin Thr 
405 410 415 
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Pro Val Val Ala He Gin Asr. Ala Leu Pre Pre Glr. Thr Ser Pre He 
4 2 0 4 2 5 430 

Gly He Ser Leu Ala Met Phe Gly Glr. Thr Phe Gly Gly Ser Leu Phe 
4 35 440 445 

Leu Thr Leu Thr Glu Leu Val Phe Ser Asn Gly Leu Asd Ser Gly Leu 
450 455 460 

Arg Gin Tyr Ala Pre Thr Leu Asn Ala Gin Glu Val Thr Ala Ala Gly 
465 470 475 480 

Ala Thr Gly Phe Arg Gin Val Val Pro Ala Pro Leu He Ser Arg Val 
485 490 495 

Leu Leu Ala Tyr Ser Lys Gly Val Asp His Ala Phe Tyr Val Ala Val 
500 505 510 

Gly Ala Ser Gly Ala Thr Phe He Phe. Ala Trp Gly Met Gly Arg Leu 
515 520 525 

Ala Trp Arg Gly Trp Arg Met Gin Glu Lys Gly Arg Ser Glu 
530 535 540 

<210> 10 
<211> 2532 
<212> PRT 

<213> Aspergillus terreus 
<400> 10 

Met Thr Pro Leu Asp Ala Pro Gly Ala Pro Ala Pro He Ala Met Val 
15 10 15 

Gly Met Gly Cys Arg Phe Gly Gly Gly Ala Thr Asp Pro Gin Lys Leu 
20 25 30 

Trp Lys Leu Leu Glu Glu Gly Gly Ser Ala Trp Ser Lys lie Pro Pro 
35 40 45 

Ser Arg Phe Asn Val Gly Gly Val Tyr His Pro Asn Gly Gin Arg Val 
50 55 60 

Gly Ser Met His Val Arg Gly Gly His Phe Leu Asp Glu Asp Pro Ala 
65 70 75 80 

Leu Phe Asp Ala Ser Phe Phe Asn Met Ser Thr Glu Val Ala Ser Cys 
85 90 95 

Met Asp Pro Gin Tyr Arg Leu lie Leu Glu Val Val Tyr Glu Ala Leu 
100 105 110 

Glu Ala Ala Gly He Pro Leu Glu Gin Val Ser Gly Ser Lys Thr Gly 
115 120 125 

Val Phe Ala Gly Thr Met Tyr His Asp Tyr Gin Gly Ser Phe Gin Arg 
130 135 140 

Gin Pro Glu Ala Leu Pro Arg Tyr Phe lie Thr Gly Asn Ala Gly Thr 
145 150 155 160 

Met Leu Ala Asn Arg Val Ser His Phe Tvr Asp Leu Arg Gly Pro Ser 
165 170 175 

Val Ser lie Asp Thr Ala Cys Ser Thr Thr Leu Thr Ala Leu His Leu 
180 185 190 
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Ala lie Gin Ser Leu Arc Ala Gly Glu Ser Aso Met: Ala He Val Ala 
195 200 ' 2C5 

Gly Ala Asn Leu Leu Leu Asn Pro Asp Val Phe Thr T hr M^t Ser Asn 
210 215 22C 

Leu Gly Phe Leu Ser Ser Asp Gly He Ser Tyr Ser Pne Asp Se- Ara 
225 230 235 *" 2 4 0 

Ala Asp Gly Tyr Gly Arg Gly Glu Gly Val Ala Ala He Val Leu Lys 
245 250 255 

Thr Leu Pro Asp Ala Val Arg Asp Gly Asp Pro He Arg Leu He Val 
260 265 270 

Arg Glu Thr Ala lie Asn Gin Asp Gly Arg Thr Pro Ala He Ser Thr 
275 280 285 

Pro Ser Gly Glu Ala Gin Glu Cys Leu lie Gin Asp Cys Tyr Gin Lvs 
290 295 300 

Ala Gin Leu Asp Pro Lys Gin Thr Ser Tyr Val Glu Ala His Gly Thr 
305 310 315 320 

Gly Thr Arg Ala Gly Asp Pro Leu Glu Leu Ala Val lie Ser Ala Ala 
325 330 335 

Phe Pro Gly Gin Gin lie Gin Val Gly Ser Val Lys Ala Asn lie Gly 
3^0 345 350 

His Thr Glu Ala Val Ser Gly Leu Ala Ser Leu lie Lys Val Ala Leu 
355 360 365 

Ala Val Glu Lys Gly Val lie Pro Pre Asn Ala Arg Phe Leu Gin Pro 
370 375 380 

Ser Lys Lys Leu Leu Lys Asp Thr His lie Gin lie Pro Leu Cys Ser 
385 390 395 400 

Gin Ser Trp He Pro Thr Asp Gly Val Arg Arg Ala Ser He Asn Asn 
405 410 415 

Phe Gly Phe Gly Gly Ala Asn Ala His Ala lie Val Glu Gin Tyr Gly 
420 425 430 

Pro Phe Ala Glu Thr Ser He Cys Pro Pro Asn Gly Tyr Ser Gly Asn 
435 440 445 

Tyr Asp Gly Asn Leu Gly Thr Asp Gin Ala His lie Tyr Val Leu Ser 
450 455 460 

Ala Lys Asp Glu Asn Ser Cys Met Arg Met Val Ser Arg Leu Cys Asp 
465 470 475 480 

Tyr Ala Thr His Ala Arg Pro Ala Asp Asp Leu Gin Leu Leu Ala Asn 
485 490 495 

He Ala Tyr Thr Leu Gly Ser Arg Arg Ser Asn Phe Arg Trp Lys Ala 
500 505 510 ' 

Val Cys Thr Ala His Ser Leu Thr Gly Leu Ala Gin Asn Leu Ala Gly 
515 520 525 

Glu Gly Met Arg Pro Ser Lys Ser Ala Asp Gin Val Arg Leu Glv Trp 
530 535 540 
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Val Phe Thr Gly Gin Gly Ala Gin Trp Phe Ala Met Glv Arc Glu Leu 
545 550 555 " " 560 

lie Glu Met Tyr Pre Val Phe Lvs Glu Ala Leu Leu Glu Cys Asc Gly 
565 570 575 

Tyr lie Lys Glu Met Gly Ser Thr Trp Ser lie lie Glu Glu Leu Ser 
580 585 590 

Arg Pro Glu Thr Glu Ser Arg Val Asp Gin Ala Glu Phe Ser Leu Pro 
595 600 605 

Leu Ser Thr Ala Leu Gin He Ala Leu Val Arg Leu Leu Tro Ser Trp 
610 615 620 

Asn He Gin Pro Val Ala Val Thr Ser His Ser Ser Gly Glu Ala Ala 
625 630 635 640 

Ala Ala Tyr Ala He Gly Ala Leu Thr Ala Arg Ser Ala He Gly He 
645 650 655 

Ser Tyr He Arg Gly Ala Leu Thr Ala Arg Asp Arg Leu Ala Ser Val 
660 665 670 

His Lys Gly Gly Met Leu Ala Val Gly Leu Ser Arg Ser Glu Val Gly 
675 680 685 

He Tyr He Arg Gin Val Pro Leu Gin Ser Glu Glu Cys Leu Val Val 
690 695 700 

Gly Cys Val Asn Ser Pro Ser Ser Val Thr Val Ser Gly Asp Leu Ser 
705 710 715 720 

Ala He Ala Lys Leu Glu Glu Leu Leu His Ala Asp Arg He Phe Ala 
725 730 735 

Arg Arg Leu Lys Val Thr Gin Ala Phe His Ser Ser His Met Asn Ser 
740 745 750 

Met Thr Asp Ala Phe Arg Ala Gly Leu Thr Glu Leu Phe Gly Ala Asp 
755 760 765 

Pro Ser Asp Ala Ala Asn Ala Ser Lys Asp Val He Tyr Ala Ser Pro 
770 775 780 

Arg Thr Gly Ala Arg Leu His Asp Met Asn Arg Leu Arg Asp Pro lie 
785 790 795 800 

His Trp Val Glu Cys Met Leu His Pro Val Glu Phe Glu Ser Ala Phe 
805 810 815 

Arg Arg Met Cys Leu Asp Glu Asn Asp His Met Pro Lys Val Asp Arg 
820 825 830 

Val He Glu lie Gly Pro His Gly Ala Leu Gly Gly Pro He Lys Gin 
835 • 840 845 

He Met Gin Leu Pro Glu Leu Ala Thr Cys Asp lie Pro Tyr Leu Ser 
850 855 860 

Cys Leu Ser A.rg Gly Lys Ser Ser Leu Ser Thr Leu Arg Leu Leu Ala 
865 870 875 880 

Ser Glu Leu He Arg Ala Gly Phe Pro Val Asp Leu Asn Ala lie Asn 
885 890 895 
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Phe Pro Arg Giy C'vs Glu Ala Ala Ara Val Gin Val Leu Ser Asp Leu 
900 905 910 

Fro Pro Tyr Pro Trp Asn His Glu Thr Arc Tyr Tro Lys Glu Pro Ara 
915 920 ^ * 925 

lie Ser Gin Ser Ala Arg Gin Arc Lys Gly Pro Val His Asp Leu lie 
930 935 " 940 

Giy Leu Gin Glu Pro Leu Asn Leu Pro Leu Ala Arg Ser Trp His Asn 
945 950 955 960 

Val Leu Arg Val Ser Asp Leu Pro Trp Leu Arg Asp His Val Val Gly 
965 970 975 

Ser His lie Val Phe Pro Giy Ala Gly Phe Val Cys Met Ala Val Met 
980 985 990 

Gly lie Ser Thr Leu Cys Ser Ser Asp His Glu Ser Asp Asp He Ser 
995 1000 - 1005 

Tyr He Leu Arg Asp Val Asn Phe Ala Gin Ala Leu He Leu Pro Ala 
1010 1015 1020 

Asp Gly Glu Glu Gly He Asp Leu Arg Leu Thr He Cys Ala Pro Asp 
1025 1030 1035 1040 

Gin Ser Leu Giy Ser Gin Asp Trp Gin Arg Phe Leu Val His Ser lie 
1045 1050 1055 

Thr Ala Asp Lys Asn Asp Trp Thr Glu His Cys Thr Gly Leu Val Arg 
1060 1065 1070 

Ala Glu Met Asp Gin Pro Pro Ser Ser Leu Ser Asn Gin Gin Arg lie 
1075 1080 1085 

Asp Pro Arg Pro Trp Ser Arg Lys Thr Ala Pro Gin Glu Leu Trp Asp 
1090 1095 1100 

Ser Leu His Arg Val Gly He Arg His Gly Pro Phe Phe Arg Asn lie 
1105 1110 1115 1120 

Thr Cys lie Glu Ser Asp Gly Arg Gly Ser Trp Cys Thr Phe Ala lie 
1125 1130 1135 

Ala Asp Thr Ala Ser Ala Met Pro His Ala Tyr Glu Ser Gin His lie 
1140 1145 1150 

Val His Pro Thr Thr Leu Asp Ser Ala Val Gin Ala Ala Tyr Thr Thr 
1155 1160 1165 

Leu Pro Phe Ala Gly Ser Arg lie Lys Ser Ala Met Val Pro Ala Arg 
1170 1175 li80 

Val Gly Cys Met Lys lie Ser Ser Arg Leu Ala Asp Leu Glu Ala Arg 
1185 1190 . 1195 1200 

Asp Met Leu Arg Ala Gin Ala Lys "Met His Ser Gin Ser Pro Ser Ala 
1205 1210 1215 

Leu Val Thr Asp Val Ala Val Phe Asp Glu Ala Asp Pro Val Gly Gly 
1220 1225 1230 

Pro Val Met Glu Leu Glu Gly Leu Val Phe Gin Ser Leu Gly Ala Ser 
1235 1240 1245 
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Leu Glv Thr Ser Asp Ara Aso Ser Thr Asp Pre Gly Asn Thr Cys Ser 
1250 "1255 1260 

Ser Trp His Trp Ala Fro Asp lie Ser Leu Val Asn Pro Gly Trc Leu 
1265 1270 1275 * 1280 

Glu Lys Thr Leu Glv Thr Gly lie Gin. Glu His Giu lie Ser Leu lie 
1285 1290 1295 

Leu Giu Leu Arg Arg Cys Ser Val His Phe lie Gin Glu Ala Met Glu 
1300 1305 1310 

Ser Leu Ser Val Gly Aso Val Glu Arg Leu Ser Gly His Leu Ala Lys 
1315 1320 1325 

Phe Tyr Ala Trp Met Gin Lys Gin Leu Ala Cys Ala Gin Asn Gly Glu 
1330 1335 1340 

Leu Gly Pro Glu Ser Ser Ser Trc Thr Arg Asp Ser Glu Gin Ala Arg 
1345 1350 1355 1360 

Cys Ser Leu Arg Ser Arg Val Val Ala Gly Ser Thr Asn Gly Glu Met 
1365 1370 1375 

lie Cys Arg Leu Gly Ser Val Leu Pro Ala lie Leu Ara Arg Glu Val 
1380 1385 '1390 

Asp Pro Leu Glu Val Met Met Asp Gly His Leu Leu Ser Arg Tyr Tyr 
1395 1400 1405 

Val Asp Ala Leu Lys Trp Ser Arg Ser Asn Ala Gin Ala Ser Glu Leu 
1410 1415 1420 

Val Arg Leu Cys Cys His Lys Asn Pro Arg Ala Arg lie Leu Glu lie 
1425 1430 1435 1440 

Gly Gly Gly Thr Gly Gly Cys Thr Gin Leu Val Val Asp Ser Leu Gly 
1445 1450 1455 

Pro Asn Pro Pro Val Gly Arg Tyr Asp Phe Thr Asp Val Ser Ala Gly 
1460 1465 1470 

Phe Phe Glu Ala Ala Arg Lys Arg Phe Ala Gly Trp Gin Asn Val Met 
1475 1480 1485 

Asp Phe Arg Lys Leu Asp He Glu Asp Asp Pro Glu Ala Gin Gly Phe 
1490 1495 1500 

Val Cys Gly Ser Tyr Asp Val Val Leu Ala Cys Gin Val Leu His Ala 
1505 1510 1515 1520 

Thr Ser Asn Met Gin Arg Thr Leu Thr Asn Val Arg Lys Leu Leu Lys 
1525 1530 1535 

Pro Gly Gly Lys Leu He Leu Val Glu Thr Thr Arg Asp Glu Leu Asp 
1540 1545 1550 

Leu Phe Phe Thr Phe Gly Leu Leu Pro Gly Trp Trp Leu Ser Glu Glu 
1555 1560 1565 

Pro Glu Arg Gin Ser Thr Pro Ser Leu Ser Pro Thr Met Trp Arg Ser 
1570 1575 1580 

Met Leu His Thr Thr Gly Phe Asn Gly Val Glu Val Glu Ala Arg Asp 
1585 1590 1595 1600 
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Cys Asp Ser His Glu Phe Tyr Met lie Ser Thr Met Met Ser Thr Ala 
1605 1610 1615 

Val Gin Ala Thr Pro Met Ser Cys Ser Val Lys Leu Pro Glu Val Leu 
1620 1625 1630 

Leu Val Tyr Val Asp Ser Ser Thr Pro Met Ser Trp He Ser Asp Leu 
1635 1640 1645 

Gin Gly Glu He Arg Gly Arg Asn Cys Ser Val Thr Ser Leu Gin Ala 
1650 1655 1660 

Leu Arg Gin Val Pro Pro Thr Glu Gly Gin He Cys Val Phe Leu Gly 
1665 1670 1675 1680 

Glu Val Glu His Ser Met Leu Gly Ser Val Thr Asn Asp Aso Phe Thr 
1685 1690 ' ' 1695 

Leu Leu Thr Ser Met Leu Gin Leu Ala Gly Gly Thr Leu Tro Val Thr 
1700 1705 1710 

Gin Gly Ala Thr Met Lys Ser Asp Asp Pro Leu Lys Ala Leu His Leu 
1715 1720 1725 

Gly Leu Leu Arg Thr Met Arg Asn Glu Ser His Gly Lys Arq Phe Val 
1730 1735 1740 

Ser Leu Asp Leu Asp Pro Ser Arg Asn Pro Trp Thr Gly Asp Ser Arq 
1745 1750 1755 1760 

Asp Ala He Val Ser Val Leu Asp Leu He Ser Met Ser Asp Glu Lys 
1765 1770 1775 

Glu Phe Asp Tyr Ala Glu Arg Asp Gly Val He His Val Pro Arg Ala 
1780 1785 1790 

Phe Ser Asp Ser He Asn Gly Gly Glu Glu Asp Gly Tyr Ala Leu Glu 
1795 1800 1805 

Pro Phe Gin Asp Ser Gin His Leu Leu Arg Leu Asp He Gin Thr Pro 
1810 1815 1820 

Gly Leu Leu Asp Ser Leu His Phe Thr Lys Arg Asn Val Asp Thr Tyr 
1825 1830 1835 1840 

Glu Pro Asp Lys Leu Pro Asp Asp Trp Val Glu He Glu Pro Arg Ala 
1845 1850 1855 

Phe Gly Leu Asn Phe Arg Asp He Met Val Ala Met Gly Gin Leu Glu 
I860 1865 1870 

Ser Asn Val Met Gly Phe Glu Cys Ala Gly Val Val Thr Ser Leu Ser 
1875 1880 1885 

Glu Thr Ala Arg Thr He Ala Pro Gly Leu Ala Val Gly Asp Arg Val 
1890 1895 1900 

Cys Ala Leu Met Asn Gly His Trp Ala Ser Arg Val Thr Thr Ser Arq 
1905 1910 1915 1920 

Thr Asn Val Val Arg He Pro Glu Thr Leu Ser Phe Pro His Ala Ala 
1525 1930 1935 

Ser He Pro Leu Ala Phe Thr Thr Ala Tyr He Ser Leu Tyr Thr Val 
1940 1945 1950 
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Ala Arg lie Leu Pro Glv Glu Thr Val Leu lie His Ala Gly Ala Giy 
1955 - 1960 1965 

Gly Val Gly Gin Ala Aia lie lie Leu Ala Gin Leu Thr Gly Ala Glu 
1970 1975 1980 

Val Phe Thr Thr Ala Giy Ser Glu Thr Lys Arg Asn Leu Leu lie Asp 
1985 1990 1995 2000 

Lys Phe His Leu Asp Pro Asp His Val Phe Ser Ser Arg Asp Ser Ser 
2005 2010 2015 

Phe Val Asp Gly lie Lys Thr Arg Thr Ara Giy Lys Gly Val Asp Val 
2020 2025 2030 

Val Leu Asn Ser Leu Ala Gly Pro Leu Leu Gin Lvs Ser Phe Asp Cys 
2035 2040 " 2045 

Leu Ala Arg Phe Gly Arg Phe Val Glu lie Gly Lys Lys Asp Leu Glu 
2050 2055 2060 

Gin Asn Ser Arg Leu Asp Met Ser Thr Phe Val Arg Asn Val Ser Phe 
2065 2070 2075 2080 

Ser Ser Val Asp He Leu Tyr Trp Gin Gin Ala Lys Pro Ala Glu He 
2085 2090 2095 

Phe Gin Ala Met Ser Glu Val He Leu Leu Trp Glu Arg Thr Ala He 
2100 2105 2110 

Gly Leu He His Pro He Ser Glu Tyr Pro Met Ser Ala Leu Glu Lys 
2115 2120 2125 

Ala Phe Arg Thr Met Gin Ser Gly Gin His Val Gly Lys lie Val Val 
2130 2135 . 2140 

Thr Val Ala Pro Asp Asp Ala Val Leu Val Arg Gin Glu Arg Met Pro 
2145 2150 2155 2160 

Leu Phe Leu Lys Pro Asn Val Ser Tyr Leu Val Ala Gly Gly Leu Gly 
2165 2170 2175 

Gly He Gly Arg Arg He Cys Glu Trp Leu Val Asp Arg Gly Ala Arg 
2180 2185 2190 

Tyr Leu He He Leu Ser Arg Thr Ala Arg Val Asp Pro Val Val Thr 
2195 2200 2205 

Ser Leu Gin Glu Arg Gly Cys Thr Val Ser Val Gin Ala Cys Asp Val 
2210 2215 2220 

Ala Asp Glu Ser Gin Leu Glu Ala Ala Leu Gin Gin Cys Arg Ala Glu 
2225 2230 2235 2240 

Glu Met Pro Pro He Arg Gly Val He Gin Gly Ala Met Val Leu Lys 
2245 2250 2255 

Asp Ala Leu Val Ser Gin Met Thr Ala Asp Gly Phe His Ala Ala Leu 
2260 2265 2270 

Arg Pro Lys Val Gin Gly Ser Trp Asn Leu His Arg He Ala Ser Asp 
2275 2280 2285 

Val Asp Phe Phe Val Met Leu Ser Ser Leu Val Gly Val Met Gly Gly 
2290 2295 2300 
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Ala Giy Gin Ala Asn Tvr Ala Ala Ala Gly Ala Phe Gin Asp Ala Leu 
2305 231C 2315 2320 

Ala Glu His Arg Met Ala His Asn Gin Pro Ala Val Thr He Asp Leu 
2325 2330 2335 

Gly Met Val Gin Ser He Gly Tyr Val Ala Glu Thr Asp Ser Ala Val 
2 3 4 C 23 4 5 2 350 

Ala Glu Arg Leu Gin Arg He Giy Tyr Gin Pro Leu His Glu Glu Glu 
2355 2360 2365 

Val Leu Asp Val Leu Glu Gin Ala He Ser Pro Val Cys Ser Pro Ala 
2370 2375 2380 

Ala Pro Thr Arg Pro Ala Val lie Val Thr Gly lie Asn Thr Arg Pro 
2385 2390 2395 2400 

Gly Pro His Trp Ala His Ala Asp Trp Met Gin Glu Ala Arg Phe Ala 
2405 2410 2415 

Gly He Lys Tyr Arg Asp Pro Leu Arg Asp Asn His Gly Ala Leu Ser 
2420 2425 2430 

Leu Thr Pro Ala Glu Asp Asp Asn Leu His Ala Arg Leu Asn Arg Ala 
2435 2440 2445 

lie Ser Gin Gin Glu Ser lie Ala Val lie Met Glu Ala Met Ser Cys 
2450 2455 2460 

Lys Leu lie Ser Met Phe Gly Leu Thr Asp Ser Glu Met Ser Ala Thr 
2465 2470 2475 2480 

Gin Thr Leu Ala Gly lie Gly Val Asp Ser Leu Val Ala He Glu Leu 
2485 2490 2495 

Arg Asn Trp He Thr Ala Lys Phe Asn Val Asp He Ser Val Phe Glu 
2500 2505 2510 

Leu Met Glu Gly Arg Thr lie Ala Lys Val Ala Glu Val Val Leu Gin 
2515 2520 2525 

Arg Tyr Lys Ala 
2530 



<210> 11 
<211> 249 
<212> PRT 

<213> Aspergillus terreus 
<400> 11 

Met Ala Thr Gin Glu Phe Leu Ser Asp Val Ser Ser Gly Phe Leu Ser 
15 10 15 

Ala Glu Ala lie Ara Tyr Arg Val Lys Thr Gly Val Ser Met Asp Gly 
20 25 30 

Trp Met Lys Arg Gly Tyr Ser Cys Asn Ser Val Arg Thr Asp Asp Lys 
35 40 45 

His His Leu Arg His Leu Thr Asn lie Gly Leu Asp Thr Pro Pro Cys 
50 55 60 

Pro Lys Ser Leu Pro Ala Ala His Ser Ala Val Ala Ser Cys Leu Thr 
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Phe Val Pre Fro Asp Pro Cvs Glu Asn Tro GIu Ala Leu Gin Val Ala 
8 5 90 9 5 

Trp Asp Lys Ala Cys Cys Ara Asn Pro Thr Pro Leu Phe Phe lie Cys 
100 105 110 

Val Ser Leu Leu Phe Ser Phe Tvr Ser Leu Trp Leu Gin Arg Gly Gly 
115 120 125 

Cys Gly Arg Tyr Gly Gly Leu His Arg Val Ser Lys Val Phe Pro Lys 
130 135 140 

Val Trp Pro Asp Asp Met Asp Ser Gin Leu Pro Ger Arg Leu Gin Thr 
145 150 155 160 

Leu Val Ser Lys Arg Lys Pro GIu Pro Ala Pro Asn Asn Ser Thr Tyr 
165 170 175 

lie Ser Lys Gly Tyr Ala Thr Phe Phe Asn Gin Phe Ser Leu Pro Ser 
180 185 190 

Val Asp Val Thr Gin lie Leu Asn Gin Thr Leu Gin His His Asp Val 
195 200 205 

Glu Thr lie Asn Leu Asp Cys Gly Ser Gly Leu Leu Thr Leu Arg Thr 
210 215 220 

Gin Leu Arg lie Leu Leu lie Gly Lys Pro Lys lie lie Lys Pro Phe 
225 230 235 240 

Ser Gly Leu Arg Thr Ser lie Asn GIu 



<210> 12 
<211> 742 
<212> PRT 

<213> Aspergillus terreus 
<400> 12 

Met GIu Ser Ala GIu Leu Ser Ser Lys Arg Gin Ala Phe Pro Ala Cys 
15 10 15 

Asp GIu Cys Arg lie Arg Lys Val Arg Cys Ser Lys GIu Gly Pro Lys 
20 25 30 

Cys Ser His Cys Leu Arg Tyr Asn Leu Pro Cys GIu Phe Ser Asn Lys 
35 40 45 

Val Ala Arg Asp Val GIu Lys Leu Gly Ser Arg Val Gly Asp He GIu 
50 55 60 

His Ala Leu Gin Arg Cys Leu Ser Phe He Asp Ala His Gin Gly Phe 
65 70 75 80 

Arg Asp Leu Ser Arg Pro Gin Ser Gin GIu Ser Gly Tyr Thr Ser Ser 
85 90 95 

Thr Ser Ser GIu Glu Cys Glu Val Asn Leu Tyr Ser Gly Lys His Thr 
100 105 110 

Ser Pro Thr Glu Glu Aso Gly Phe Trp Pro Leu His Gly Tyr Gly Ser 
115 120 125 

Phe Val Ser Leu Val Met Glu Ala Gin Ala Ala Asn Ala Asn Leu Thr 
130 135 140 
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Ser Trp Leu Pro Val Asp Met Thr Ser Glv Gin Val Ala Glu Met Val 
145 150 155 160 

Ala Phe Asp Arg Gin Ala Val Ser Ala Val Arg Ser Lys Val Ala Glu 
165 170 175 

Ala Asn Glu Thr Leu Gin Gin lie lie Glu Asp lie Pro Thr Leu Ser 
180 185 190 

Ala Ser Glu Asn Asp Thr Phe Leu Pro Ser Leu Pro Pro Arg Ala Leu 
195 200 205 

Val Glu Pro Ser lie Asn Glu Tyr Phe Lys Lys Leu His Pro Arg Leu 
210 215 220 

Pro lie Phe Ser Arg Gin Thr lie Met Asp Ala Val Glu Ser Gin Tyr 
225 230 235 240 

Thr lie Arg Thr Gly Pro Pro Asp Leu Val Trp lie Thr Ser Phe Asn 
245 250 255 

Cys lie Val Leu Gin Ala Leu Thr Gin Thr Ser lie Ala Asn Lys Val 
260 265 270 

Val Gly Cys Thr Gly Gin Asp lie Pro lie Asp Tyr Met lie lie Ser 
275 280 285 

Leu Leu Arg Asn lie Arg Gin Cys Tyr Asn Arg Leu Glu Thr Leu Val 
290 295 300 

Lys Pro Arg Leu Ser Asn lie Arg Ala Leu Phe Cys Leu Ala Leu Val 
305 310 315 320 

Ala Met Giu Tyr Phe Asp Phe Ala lie Phe Leu Thr lie Phe Ala Gin 
325 330 335 

Val Cys Glu Leu Ser Arg Leu lie Gly Leu His Leu Thr Thr Thr Thr 
340 345 350 

Pro Pro Thr Giu .ASp Gly Ala Val Gly Asp Gin Pro Lys Asp Leu Phe 
355 360 365 

Trp Ser lie Phe Leu Val Asp Lys His Val Ser lie lie Gly Gly Lys 
370 375 380 

Ala Cys Leu Leu Pro Ser Tyr Asp Cys Ser Val Pro Leu Pro Pro Tyr 
385 390 395 400 

Asp Ser Ala Ala Pro Leu Pro Asn Ala Phe Ala Ala Arg lie Arg Leu 
405 410 415 

Ala Phe lie Leu Glu Glu lie Tyr Leu Gly Leu Tyr Ser Ala Lys Ser 
420 425 430 

Ser Lys Met Glu Gin Ser Arg Val Arg Arg Arg lie Arg Arg lie Ala 
435 440 445 

Arg Lys Leu Ser Gin Trp His Val Gin His Glu His Val Leu Arg Thr 
450 455 460 

Gly Asp Pro Asn Arg Pro Leu Giu Glu Tyr lie Cys Ala Thr Gin Leu 
465 470 475 480 

Arg Phe Ala Leu Ser Ser Cys Trp Val Leu Leu His Lys Arg lie Trp 
485 490 495 
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Ser Gin Glu Arg Gly Ala Val Cys Leu Gin His Ala Arg Asp Cys Leu 
500 505 510 

Met Leu Phe Lys Gin Leu Cys Asp Gly Cys Lys Ser Gly Phe Ser Asn 
515 520 525 

Phe Asp Ser He Val Leu Asn Tyr Ser Leu He Ser Phe Met Gly He 
530 535 540 

Tyr Vai His He Val Glu Glu Asp Gin Pro lie His Ser Gin Asp Met 
545 550 555 560 

Glu He Leu Thr Phe Phe Ala lie Tyr Thr Asn Arg Ser Ala Ser Asn 
565 570 575 

Arg Ser Ser Ala Ser lie Ser Tyr Lys Leu Ser Gin Val Ala Ser Arg 
580 585 590 

Cys Ser Asp He Ala Leu Leu Leu Gin- Asn Leu Arg Glu Arg Arg Phe 
595 600 605 

He Pro Thr Thr lie Ser Arg Ser Pro Thr Pro Ser Trp Asn Glu Pro 
610 615 620 

Thr Tyr Met Asp Tyr Asp Val Ala Asn Ala Ser Thr Ser Thr Thr Ser 
625 630 635 640 

Thr Gly Ser Ser Tyr Asn Leu Asn lie Ser Pro Leu Gly Val Pro Gly 
645 650 655 

Asp Gly Gin Val Trp Asp lie Tyr Phe Asn Pro Arg Glu lie Pro Met 
660 665 670 

Asp Gly Thr lie Ala Thr Pro Ser Glu Asp Ala Thr Gin Asp Leu Leu 
675 680 685 

Ser Asn Asp Ala Gly Gin Cys Leu Gly Phe Pro Asp Phe Ser Leu Gly 
690 695 700 

He Asp Asn Phe Ser Asp Phe Pro Leu Gly lie Asp Met Thr Ser Gin 
705 710 715 720 

Ser Glu Phe Gly Leu lie Met Glu Glu Asp lie lie Arg Tyr Glu Arg 
725 730 735 

Leu Leu Asp Arg Pro Val 
740 

<210> 13 
<211> 301 
<212> PRT 

<213> Aspergillus terreus 
<400> 13 

Met Glu Ser Lys Val Gin Thr Asn Val Pro Leu Pro Lys Ala Pro Leu 
15 10 15 

Thr Gin Lys Ala Arg Gly Lys Arg Thr Lys Gly He Pro Ala Leu Val 
20 25 30 

Ala Gly Ala Cys Ala Gly Ala Val Glu lie Ser lie Thr Tyr Pro Phe 
35 40 45 

Glu Ser Ala Lys Thr Arg Ala Gin Leu Lys Arg Arg Asn His Asp Val 
50 55 60 
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Ala Ala lie Lys Pro Gly lie Arc GIv Trc Tyr Ala Gly Tvr GI v Ala 
65 " 7 0 7 5 S 0 

Thr Leu Val Gly Thr Thr Leu Lys Ala Ser Val Gin Phe Ala Ser Phe 
85 90 95 

Asp. He Tyr Arg Ser Ala Leu Ser Gly Pro Asn Gly Glu Leu Ser Thr 
100 105 110 

GIv Ala Ser Val Leu Ala Gly Phe Gly Ala Gly Val Thr Glu Ala Val 
115 120 125 

Leu Ala Val Thr Pro Ala Glu Ala lie Lys Thr Lys He He Asd Ala 
130 135 140 

Arg Lys Val Gly Asn Ala Glu Leu Ser Thr Thr Phe Gly Ala He Ala 
145 ISO 155 160 

Gly He Leu Arg Asp Arg Gly Pro Leu Gly Phe Phe Ser Ala Val Gly 
165 170 175 

Pro Thr lie Leu Arg Gin Ser Ser Asn Ala Ala Val Lys Phe Thr Val 
180 185 190 

Tyr Asn Glu Leu lie Gly Leu Ala Arg Lys Tyr Ser Lys Asn GIv Glu 
195 200 205 

Asp Val His Pro Leu Ala Ser Thr Leu Val Gly Ser Val Thr Gly Val 
210 215 220 

Cys Cys Ala Trp Ser Thr Gin Pro Leu Asp Val He Lys Thr Arg Met 
225 230 235 240 

Gin Ser Leu Gin Ala Ara Gin Leu Tyr Gly Asn Thr Phe Asn Cys Val 

245 250 255 

Lys Thr Leu Leu Arg Asn Glu Gly lie Gly Val Phe Trp Ser Gly Val 
260 265 270 

Trp Phe Arg Thr Gly Arg Leu Ser Leu Thr Ser Ala He Met Phe Pro" 
275 280 285 

Val Tyr Glu Lys Val Tyr Lys Phe Leu Thr Gin Pro Asn 
290 295 300 



<210> 14 
<211> 490 
<212> PRT 

<213> Aspergillus terreus 
<400> 14 

Met Thr Lys Gin Ser Ala Asp Ser Asn Ala Lys Ser Gly Val Thr Ala 
15 10 15 

Glu He Cys His Trp Ala Ser Asn Leu Ala Thr Asp Asp lie Pro Pro 
20 25 30 

Asp Val Leu Glu Arg Ala Lys Tyr Leu He Leu Asp Gly He Ala Cys 
35 40 45 

Ala Trp Val Gly Ala Arg Val Pro Trp Ser Glu Lys Tyr Val Gin Ala 
50 55 60 

Thr Met Ser Phe Glu Pro Pro Gly Ala Cys Arg Val lie GIv Tyr Gly 
65 70 75 " 80 
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Gin Lys Leu Gly Pro Vai Ala Ala Ala Met Thr Asn Ser Ala Phe lie 
Si 90 5 5 

Gin Ala Thr Glu Leu Asc Aso Tvr His Ser Glu Ala Pro Leu His Ser 
10C * 105 110 

Ala Ser lie Val Leu Pro Ala Val Phe Ala Ala Ser Glu Vai Leu Ala 
115 120 125 

Glu Gin Gly Lys Thr lie Ser Gly He Ala Val lie Leu Ala Ala He 
130 135 140 

Val Gly Phe Glu Ser Gly Pro Arg He Gly Lys Ala He Tyr Gly Ser 
145 150 155 160 

Asp Leu Leu Asn Asn Gly Trp His Cys Gly Ala Val Tyr Gly Ala Pro 
165 170 175 

Ala Gly Ala Leu Ala Thr Gly Lys Leu Leu Gly Leu Thr Pro Asp Ser 
180 185 190 

Met Glu Asp Ala Leu Gly lie Ala Cys Thr Gin Ala Cys Gly Leu Met 
195 200 205 

Ser Ala Gin Tyr Gly Gly Met Val Lys Arg Val Gin His Gly Phe Ala 
210 215 220 

Ala Arg Asn Gly Leu Leu Gly Gly Leu Leu Ala His Gly Gly Tyr Glu 
225 230 235 240 

Ala Met Lys Gly Val Leu Glu Arg Ser Tyr Gly Gly Phe Leu Lys ' Met 
245 250 255 

Phe Thr Lys Gly Asn Gly Arg Glu Pro Pro Tyr Lys Glu Glu Glu Val 
260 265 270 

Val Ala Gly Leu Gly Ser Phe Trp His Thr Phe Thr lie Arg He Lvs 
275 280 285 

Leu Tyr A.la Cys Cys Gly Leu Val His Gly Pro Val Glu Ala lie Glu 
290 295 300 

Asn Leu Gin Arg Arg Tyr Pro Glu Leu Leu Asn Arg Ala Asn Leu Ser 
305 310 315 320 

Asn lie Arg His Val His Val Gin Leu Ser Thr Ala Ser Asn Ser His 
325 330 335 

Cys Gly Trp lie Pro Glu Glu Arg Pro lie Ser Ser lie Ala Gly Gin 
340 345 350 

Met Ser Val Ala Tyr lie Leu Ala Val Gin Leu Val Asp Gin Gin Cys 
355 360 365 

Leu Leu Ala Gin Phe Ser Glu Phe Asp Asp Asn Leu Glu Arg Pro Glu 
370 375 380 

Val Trp Asp Leu Ala Arg Lys Val Thr Pro Ser His Ser Glu Glu Phe 
385 390 395 400 

Asp Gin Asp Gly Asn Cys Leu Ser Ala Gly Arg Val Arg lie Glu Phe 
405 410 415 

Asn Asp Gly Ser Ser Val Thr Glu Thr Val Glu Lys Pro Leu Gly Val 
420 425 430 
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Lvs Glu Pre Met Pre Asn 
4 35 

Ala Gly Ser Val Tr.r Asc 
450 

Val Leu Ser Leu Asd Arc 
465 " 470 

Leu Asn Cys Pro Val Lys 
455 



Glu Aro lie Leu His Lys 
44 0 

Glu Thr Arg Val Lys Glu 
455 " 460 

Leu Thr Asp He Ser Pro 
475 

Ser Pro Leu Val 
490 



PCT/US99/29583 

Tyr Arg Thr Leu 
44 5 

He Glu Asp Leu 

Leu Leu Glu Leu 
. 480 



<210> 15 
<211> 488 
<212> PRT 

<213> Aspergillus terreus 
<400> 15 

Met Gly Arg Gly Asp Thr Glu Ser Pro Asn Pro Ala Thr Thr Ser Glu 
15 10 15 

Gly Ser Gly Gin Asn Glu Pro Glu Lys Lys Gly Arg Asp He. Pro Leu 
20 25 30 

Trp Arg Lys Cys Val He Thr Phe Val Val Ser Trp Met Thr Leu Val 
35 40 45 

Val Thr Phe Ser Ser Thr Cys Leu Leu Pro Ala Ala Pro Glu He Ala 
50 55 60 

Asn Glu Phe Asp Met Thr Val Glu Thr lie Asn He Ser Asn Ala Gly 
65 70 75 80 

Val Leu Val Ala Met Gly Tyr Ser Ser Leu He Trp Gly Pro Met Asn 

85 90 95 

Lvs Leu Val Gly Arg Ara Thr Ser Tyr Asn Leu Ala lie Ser Met Leu 
100 105 110 

Cys Ala Cys Ser Ala Gly Thr Ala Ala Ala lie Asn Glu Lys Met Phe 
115 120 125 

lie Ala Phe Arg Val Leu Ser Gly Leu Thr Gly Thr Ser Phe Met Val 
130 135 140 

Ser Gly Gin Thr Val Leu Ala Asp lie Phe Glu Pro Val Tyr Arg Gly 
145 150 155 160 

Thr Ala Val Gly Phe Phe Met Ala Gly Thr Leu Ser Gly Pro Ala lie 
165 170 175 

Gly Pro Cys Val Gly Gly Val lie Val Thr Phe Thr Ser Trp Arg Val 
180 185 190 

lie Phe Trp Leu Gin Leu Gly Met Ser Gly Leu Gly Leu Val Leu Ser 
195 200 205 

Leu Leu Phe Phe Pro Lys lie Glu Gly Thr Ser Glu Lys Val Ser Thr 
210 215 220 

Ala Phe Lys Pro Thr Thr Leu Val Ser lie He Ser Lys Phe Ser Pro 
225 230 235 240 

Thr Asp Val Leu Lys Gin Trp Val Tvr Pro Asn Val Phe Leu Ala Val 
245 250 255 
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Ser Ala Trp Glu lie Cys Fro Leu His Leu Leu Glu Thr Lvs Cys Ser 
260 265 270 

Cys Arg Lvs Gin Lys Asp Leu Cvs Cys Gly Leu Leu Ala lie Thr Gin 
275 280 285 

Tyr Ser lie Leu Thr Ser Ala Arg Ala lie Phe Asn Ser Arg Phe His 
290 295 300 

Leu Thr Thr Ala Leu Vai Ser Gly Leu Phe Tvr Leu Ala Pro Gly Ala 
305 310 315 320 

Gly Phe Leu lie Gly Ser Leu Val Gly Gly Lys Leu Ser Asp Arg Thr 
325 330 335 

Val Arg Arg Tyr lie Val Lys Arg Gly Phe Arg Leu Pro Gin Asp Arg 
340- 345 350 

Leu His Ser Gly Leu lie Thr Leu Phe. Ala Val Leu Pro Ala Gly Thr 
355 360 365 

Leu lie Tyr Gly Trp Thr Leu Gin Glu Asp Lys Gly Gly Met Val Val 
370 375 380 

Pro lie lie Ala Ala Phe Phe Ala Gly Trp Gly Leu Met Gly Ser Phe 
385 390 395 400 

Asn Cys Leu Asn Thr Tyr Val Ala Val Glu Ala Leu Pro Arg Asn Arg 
405 410 415 

Ser Ala Val He Ala Gly Lys Tyr Met He Gin Tyr Ser Phe Ser Ala 
420 425 430 

Gly Ser Ser Ala Leu Val Val Pro Val He Asp Ala Leu Gly Val Gly 
435 440 445 

Trp Thr Phe Thr Leu Cys Val Val Ala Ser Thr He Ala Gly Leu He 
450 455 460 

Thr Ala Ala He Ala Arg Trp Gly He Asn Met Gin Arg Trp Ala Glu 
465 470 475 480 

Arg Ala Phe Asn Leu Pro Thr Gin 



<210> 16 
<211> 516 
<212> PRT 

<213> Aspergillus terreus 
<400> 16 

Met Thr Leu Gin He He Val He Ala Ala Thr Ala Val He Tyr Phe 
15 10 15 

Leu Thr Arg Tyr Phe Asn Arg Thr Asp lie Pro Lys lie Lys Gly He 
20 25 30 

Pro Glu lie Pro Gly Val Pro He Phe Gly Asn Leu lie Gin Leu Gly 
35 40 45 

Val Lys His Ala Thr Val Ala Arg Lys Trp Ser Lys Glu Phe Gly Pro 
50 55 60 

Val Phe Gin Ala Arg Leu Gly Asn Arg Arg Val lie Phe Ala Asn Thr 
65 70 75 80 
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Phe Glu Ser Thr Arg Gin Leu Trp He Lvs Glu Gin Ser Ser Me: He 
85 90 95 

Ser Arg Pro Thr Phe His Thr Phe His Gly Val Val Ser Ser Ser Gin 
100 105 110 

Gly Phe Thr He Gly Thr Ser Pro Trp Asp Glu Ser Cys Lys Arg Ara 
115 120 125 

Arg Lys Ala Ala Ala Thr Ala Leu Asn Arg Val Ala Val Gin Ser Tyr 
130 135 140 

Met Pro He He Asp Leu Glu Ser Met Ala Ser He Lys Glu Leu Leu 
145 150 155 160 

Lys Asp Ser Gin Gly Gly Lys He Asp lie Asn Pro Thr Pro Tyr Phe 
165 170 175 

Gin Arg Phe Ala Leu Asn Thr Ser Leu . Thr Leu Asn Tyr Gly Tyr Arg 
180 185 190 

He Glu Gly Asn Val Asn Asp Gin Leu Leu Arg Glu He Cys Glu Val 
195 200 205 

Gin Arg Gly Val Ala Asn Leu Arg Ser Thr Ser Asn Asn Trp Gin Asp 
210 215 220 

Tyr Val Pro Leu Leu Arg Leu Phe Ser Asn Arg Ser Asn Gin Ala Lys 
225 230 235 240 

His Leu Arg Ala Arg Arg Asp Lys Tyr Met Ala Phe Leu Phe Asp He 
245 250 255 

Leu Lys Asp Arg Met Ala Lys Gly Thr Asp Lys Pro Cys He Thr Gly 
260 265 270 

Asn He Leu Lys Asn Pro Glu Thr Lys Leu Thr Asd Ala Glu He Lys 
275 280 * 285 

Ser He Cys Leu Thr Met Val Ser Ala Gly Leu Asp Thr Val Pro Gly 
290 295 300 

Asn Leu He Met Gly He Ala Tyr Leu Ser Ser Glu Asp Gly Gin Arg 
305 310 315 320 

lie Gin Gin Lys Ala Tyr Glu Glu lie Met Ser Val Tyr Pro Asn Gly 
325 330 335 

Asp Ala Trp Glu Arg Cys Leu Val Glu Glu Lys Val Pro Tyr lie Thr 
340 345 350 

Ala Leu Val Lys Glu Thr Leu Arg Phe Trp Thr Val Met Pro lie Cys 
355 360 365 

He Pro Arg Val Asn lie Lys Glu Val He Tyr Asn Gly Ala Arg lie 
370 375 380 

Pro Ala Gly Thr Thr Phe Phe Met Asn Ala Trp Ala Ala Asn Tyr Asp 
385 390 395 400 

Glu Asp His Phe Asp Met Pro Asn Arg Phe Leu Pro Glu Arg Tyr Leu 
405 410 415 

Glu Pro Ser Glu Gly Phe Gly Thr Pro His Tyr Ser Phe Gly Ala Gly 
420 425 430 
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Thr Arg Met Cys Ala Ala Ser His Leu Ala Ser Arg Glu Leu Tyr Thr 

435 440 445 

Val Phe Leu Arg Phe lie Val Ala Phe Thr lie Glu Pro Ala Gin Asn 

450 455 460 

Pro Ala Asp Met Pro Val Leu Asp Ala lie Glu Cys Asn Ala Thr Pro 

465 470 475 480 

Thr Ser Met Thr Thr Glu Pro Lys Pro Phe Lys Val Gly Phe Lys Pro 
485 490 " 495 

Arg Asp Glu Thr Ser Leu Arg Arg Trp lie Ala Glu Ser Glu Glu Arg 

500 505 510 



Thr Lys Glu Leu 



<210> 17 
<211> 481 
<212> PRT 

<213> Aspergillus terreus 
<400> 17 

Met Lys Pro Ala He Leu Met Lys Tyr Trp Leu Phe Val Ser Ala Val 
15 10 15 

Ser Ala Ser Thr Leu Asn Gly Lys Leu Thr Leu Ser Glu Thr Lys Val 
20 25 30 

Thr Gly Ala Val Gin Leu Ala Cys Thr Asn Ser Pro Pro Asp He Tyr 
35 40 45 

He Asp Pro Asp Asp Ser Val Ser Val Val Arg Ala Ala His Asp Leu 
50 55 60 

Ala Leu Asp Phe Gly Arg Val Phe Gly Lys Asn Ala Thr Val Arg Phe 
65 70 75 80 

Thr Asn Glu Thr His Pro Thr Ser Met Ala He He Ala Gly Thr He 
85 90 95 

Asp Lys Ser Thr Phe Leu Gin Arg Leu lie Ala Asp His Lys Leu Asp 
100 105 110 

Val Thr Ser He Arg Gly Gin Trp Glu Ser Tyr Ser Ser Ala Leu Val 
115 120 125 

Leu Gly Pro Ala Lys Gly He Gin Asn Ala Leu Val He Ala Gly Ser 
130 135 140 

Asp Arg Arg Gly Ala He Tyr Gly Leu Tyr Asp He Ser Glu Gin He 
145 150 155 160 

Gly Val Ser Pro Leu Phe Trp Trp Thr Asp Val Thr Pro Thr Lys Leu 
165 170 175 

Asp Ala He Tyr Ala Leu Asp Val Gin Lys Val Gin Gly Pro Pro Ser 
180 185 190 

Val Lys Tyr Arg Gly He Phe He Asn Asp Glu Ala Pro Ala Leu His 
195 200 205 

Asn Trp He Leu Ala Asn Tyr Gly Glu Val Glu Asn Gly Asp Pro Ala 
210 215 220 
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Phe lie Ser Arg Phe Tyr Ala His Val Phe Glu Leu lie Leu Arg Leu 
225 23 C 23 5 ^24 0 

Lys Gly Asn Tyr Leu Trp Pro Ala Met Trc Ser Asn Me: Phe Tyr Val 
2^5 250 255 

Asp Asp Thr Asn Asn Gly Pro Leu Ala Asp Tyr Tyr Gly Val Val Met 
260 265 270 

Gly Thr Ser His Thr Gly Met Thr Val Gly Thr Pro Cys Leu Lys Ala 
275 280 285 

His Ala Asp Tyr Glu Lys Glu Pro Met Ala Arg Ala Thr Asn Glu Gin 
290 295 300 

Ser Gin Phe Leu Asn Gly Thr Trp Asp Tro lie Ser Asn Glu Val Asn 
305 310 315 320 

Val Lys Ala Phe Met Arg Glu Gly Val lie Ara Ser Gin His Trp Glu 
325 330 " ' 335 

Thr Ala Tyr Thr Met Gly Met Arg Gly Leu Gly Asp Ala Ala Ser Pro 
340 345 350 

Thr Leu Asn Ala Thr Val Glu Glu Ser He Val Ser Trp Gin Glu Ser 
355 360 365 

Val Leu Ser Asp He Leu Asn Lys Thr Asn Leu Ser Asn Val Val Gin 
370 375 380. 

Pro Phe Val Leu Phe Asp Glu Leu Gly Thr Tyr Tyr Glu Ser Gly Met 
385 390 395 400 

Thr Val Pro Asp Gin Val Thr Leu He. Tyr Pro Asp Asp Asn Ala Gly 
405 410 415 

Asn Met Leu Arg Leu Pro Leu Gin Asn Glu Thr Gly Arg Ser Gly Gly 
420 425 430 

Ala Gly He Tyr Tyr His Phe Asp Met Asn Ala Pro Pro Arg Cys Tyr 
435 440 445 

Lys Trp He Asn Thr Ala Gin Leu He Arg Thr Trp Asp Gin Leu Arg 
450 455 460 

Ala Ala Tyr Ser His Gly Ala Gin Thr Val Trp Val Ala Asn He Gly 
465 470 475 480 

Asp 



<210> 18 
<211> 33000 
<212> DNA 

<213> Aspergillus terreus 
<400> 18 

tggattttct tcctgtaggc ccgtagctat 
aatattagaa actgctcgcg tatcttatcc 
gttttagcaa gctggacggc ctgcagggcg 
gaaagatagc ttatatagct ataaggcttt 
aaaataataa gaaatctaat aagagtactt 
agtaagataa ttaaaaatat taaacaggcc 
tgattaacgc gctacaatag tttaagatct 
cggctataaa tttttatcta attaagcgcg 
agtaaaatta gctggttaqt ccggctacta 



gtaatctagc taaacagagc gcgtatttta 60 

agagcgttag ctaggtaggt tacctggtct 120 

: actaatattt aggctatttt tataagcccg 180 

agaaagatct actgcttaat atctatttct 240 

ttaaagagat ctttcttaag agtatggtcg 300 

taattaagca gttctttagt ttgctgctgc 360 

tagctttaga ttaggagatt aactagctgc 420 

gtaaactagg cagtatttag ctagtggcgg 480 

tggtagqcga agtaaataag acactgctag 540 
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a z cccccc c e 

" I C I C c S ~ C 
cG'CcccCc" t 

Ic£l LCLCCC 

aaagcctcc: 
tctagtc:: t 
tacciga La a 
act t a t a 1 1 1 
taattttaat 
acgcagcta t 

5561355 tea 

get tat taga 
a taataaaac 
1 t a taggt ta 
acctagagac 
atttattaza 
a ttcgtgt ci 
tctttaaata 
ageaeggega 
gtgaact t gg 
a t tggcaa tg 
ct tgaegtec 
cgcatcctgg 
aacttcctcr 
a tegggeca t 
t ceggget eg 
etcegtat te 
geggaegace 
t tgetgegge 
g tcttcg t tg 
cagcgcgacg 
agccagatci 
accctgt at c 
attgetcaag 
acacat cat t 
gccactggtc 
ta taagtgaa 
ccacgcggca 
cgaagtgaac 
aggceggaca 
gaccacata t 
aggegqacaa 
gcaa tecgeg 
cctt tct gec 
ctttccarcc 
ggaa tgagaa 
teegt atgcg 
t taat tgagt 
tt tgggcaga 
cagccccgaa 
gaaatca tgg 
tcggtgagtg 
tacgeaact t 
tgggcect ag 
acaggaga ta 
gtggcaca tt 
a tgtctgeae 
cct eggtact 
ccaaggt tgt 
1 1 teegt cgc 
t tgaat tgat 
tccttct caa 
ggccggtcaa 
a tattgtagt 



actaacccza cc wCcctacc 

CCCCOccGiS Gclc'vLcSlC 

cciccCcCC: c t a a t t a a t t 

lcCCZgLcGg attacaaaac 



cctacataaa z 



:o:a; :ccoc 
: aa t aoaca t 
:aacccoacc 



acttgerega 
czgcccagta 
acctcieoc: 
::cc:a;ccc 
a taac taacc 



c t a t t a a a t a 
ctagtegtta 
c t caa t a a l ; 
aattacaaca accci ccc" a tatattsctt 
agaaccgcig atagctaata atataetctc 
tctaact c 



aaoi: 



::c:lc 
ggca c t 
ca 

taaaoct act 



„ ^ u a a a a , 



act 

tttaatattt atttattcte tgtc 
aactaeggta gactggaaca 
tctcctactt taga t taat t 
gggccggaat agtggtatta gaataaaagc 
tagtagaaga aactatatac tagtaggaag 
agttatagag aaagagctct attgttaatt 
ttaagaataa aggaaaatag gtttctataa 
ttttttaaag atatattaat cgcgcctcgg 
ectattgett attcctgaag ataaaarata 
agacaqaacg gcggagctgt ctgttcgggg 
aatattaaat cttgaccaat tagcaggega 
gectagggae taggcaacca ccaagagaca 
gectgatteg ctcggtccaa gctctgcgcg 
accgcaacgt cacacatgaa aaccccatag 
cacgtatccg categggcag ctggatcgat 
gtcteggteg cgttcgtcct ctcgcctctc 
gcggatagga agaagtaagt gtttgggttg 
caccagcggt ggccgtcaaa tcgcgcctcc 
cgggcggtgt tgatgtcgga aatgctggat 
tegttgaett ecttgegtag gtcgctcgtg 
tcgtagtgtg cgctttcgta ccagaaggtg 
aagaaggtga catagecagg gaegtagagg 
aaggacagca cgacaggaat aaataaggat 
atccaggatc ttttcatacg cagccctaag 
ctgeatcagg ttgegggett tctcttcagt 
cgagtaccac cacattgtta geaegcagtt 
ccctccaatt gtcagggttg ccatggtagt 
gatcttttta ttcagcccaa ccgttgtatc 
atcctcagta tcaaaccact cctggacgag 
actgtcaccg gttgtggttc ccgtacccat 
ataatcgttg acgccgtaca cagtgtcagc 
gctggagtca tccgccgaag aagcagactc 
cttggtatag tacccgccaa gtacgacaag 
ettgetgtea cagatgeett cttgcccgag 
ttgccagtta gcatcgccgg etataeggag 
tctttcgata cagtgtcgat tgaaggtccc 
atagacataa tcgcgggtcc tcctaagqtg 
aaaatatgtg ataccaccca ttctctccat 
ttacttcgtg caaaatcgcc ccggctcgag 
cgcacctcgc gagcactgac ctggaactct 
aattaagccg attcctattg ttcatcatgg 
ggtttttgca agtgatcgtg aacttttcgt 
tgaccatccg gtcgaacgte taccttacat 
atteggcaaa gaagatgtcc ctatatgggt 
ccgaggcagc ateaagegea tattgeattt 
tgcactcgct gttgtcctga taatgagggg 
caggtagatt ggggttagtt ategggcegt 
cgttgtacca cctctgcgct gtgttgttta 
gaccgtcgcc cttcagatca gggaaataag 
tgggaccctg cggtccccat tegtatcgeg 
accacaccga cccatctccc gagaacttgt 
cgttgacgtc ggcccagtgt agattcgccc 
caatgtagtc ccatccatcg tccccattga 
cgcacagata ateggecttg ccgttgccag 
agaaaccaag gcctcggtgc teggegcagt 
cccagtcaaa actccccgtg tetttgatct 



ctacattag ctggttttag 
tgaatttatt aatccggccc 
cgctagcagt aaattacttt 
gggcgactgc ggtttaatct 
ataaggaaat tgtacttaga 
coctagccta atttttttat 
cctttaqaca gacccetttc 

loccCcc l lc c C L c C Z o G 5 l 

ttagtcgege gcgaacccta 
gtgecctatt auaattagta 
aatagatata tttataatat 
gagagctact tagagggggc 
gaagtattac agttaaagca 
tataacgegt ttggaggtag 
gtgtagtata ga tct agate 
ctaggctcta agagaaaagt 
ggagctattt tctttttaat 
atacaegtag taaaaagtaa 
gatataatct cgctaggtcc 
cacccatgta aggecaeggg 
gaaaatggat cgaaggttgg 
tcctggctac tatagtccct 
atcaaactco aegagttege 
ggatccgggt cagatcccag 
ccggagttca taagcgcggt 
gtgtctccaa caatcgegat 
teggggtctg gttcttgggt 
atatcgatgt aatggatctg 
tggattaacg agttgagcat 
agccacacat agttgccgga 
gttgagtege agtcggtggt 
ttgaacgact aaagtaaagt 
aacagggaga tataegctag 
tttcgagccg agtccgtcag 
ctccagacac cactggcggt 
gqaaaccaga tegctgaaga 
ccccgtgggg tcctgtccta 
tccagagcac gcatagttgg 
etttcegtag ctgttacttc 
cccagcggcg taggagtcac' 
atcataagac gaatcatcgt 
ctcaggaagg ggaatctgtc 
ageccatgaa teegcattgt 
gttatttggg tctgggcaaa 
etcattageg gctctttcag 
tgeggcaaga tctcgagctc 
aaagaatcag tateggectg 
tgacaatctc cataacatag 
tgagatgggc at'eatgaaga 
tcaactcatc tgtgaagaac 
ccaaccagcc gttctegtea 
cgtcgaattq cgaagaccca 
ttatagtgaa ggtcaagctg 
cgtatgcgcc qccgctaaag 
cttcaaagag cgtctgcacc 
tgaqctcga t gggggctttt 
catcgcctgt gtggtctttg 
tggagttcca gatgctgtgc 
tgeatgagee ttcaaccgcc 
atcccttgat atcacgttgg 
ctgtccagat catatcggcc 
tgtccttctc cteggagtae 
cccagcccca ggtgcggccq 
aaacatcagc aaaatggacc 
aaagctcatc tgcagcattg 
tgttccgcca aacttgggee 
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crct zcacaz 
cccagctoca 
a t c t c a t t co 
cgat aaaca c 
tcccccarca 

ca tgaacg t a 
agtcctgtcr 
cctgatcccg 
agrtagca t c 
ctttgatgca 
tccagtcgt c 
tgact t tccc 
atccatt teg 
eg tcgccgtc 
.atagtccgat 
tgatget t gc 
eta tea 1 1 ta 
t accgt tga t 
tecccccct t 
tcaccgcgcc 
aaagcctccc 
tcccggcgcg 
ca t cctcacc 
tgtcacaggt 
ccgcagcgt t 
t ggggtgct t 
gagceggegg 
ggaccagctc 
ccgagggga t 
gggt t tcga t 
agtcgttgg t 
ccgcggtt tg 
tttatatgtc 
tgtctacca t 
gaagt ttgtc 
tgatggatgc 
ctcttggacg 
agacagggac 
ccgtaaagaa 
tcca tact gc 
cct eggegta 
gt accaggat 
caagtt tggg 
gggtttat tt 
gaaccagaat 
catcatctgt 
ggtgtttgca 
tagctgagac 
tattggataa 
tatagagcac 
tetgect t tg 
tctaccaaca 
t cagcaccgt 
cagaa tcctc 
tcgaaactcc 
agaacacttc 
tgagcccccc 
atgcttagtc 
catcctacta 
cgcaatcacc 
getgategga 
ccgea tegeg 
ateggtet ta 
actctgggta 
t tgga tgtat 



r.c:cogc5t c 
qa z cocgccc 
ccccccacaa 
ccatatggcc 
z a z r. cccc lc 
cccttccata 
atacagegge 

CCCGaGCt c t 

taa z ccgaa t 
t tggt agaac 
get cegggag 
gcgcccgt eg 
cgtgaagcgc 
ccagca tgtg 
gatatcagca 
a tegq tgaag 
g tagg tgg tc 

tCaCSCatoo 

gt cga taaaa 
cgt aaactcc 
ggagtcct ta 
gaagaaaaag 
gacagtaaac 
act ccct tgc 
ccccgtggat 
gtagatcgcg 
gt tat eggeg 
a tcca ta teg 
gcggaactgc 
cagggt cgac 
cagggagege 
geeggegt tg 
cact tgcg 1 1 
aat tggtctg 
gtcaccgt tg 
aeggagaggt 
gecaagaggg 
ccattcagca 
ggctgctcct 
aagtagagac 
gagtcget tt 
tgtgccgt tc 
aaacctgga t 
ggccataa tc 
ggcgtagt 1 1 
agtaccaaag 
gacagggaga 
actacgegga 
atggcactag 
gt tatt taca 
tatggacact 
atctctatca 
ct tcct aact 
t ccccgaaa t 
a tea tcctaa 
acgcacggca 
acca tcctat 
acgaccggcg 
ctcttctgcc 
accata taca 
actcccgagg 
ctctttcttt 
ggat tcatag 
agectgacag 
ctcagggt tc 
tggtaccatg 



cczccacaz z 
Cicaacc: z z 
cc:ccac:c: 
get tcaa zee 
cctGtctccc 
cccti.ga!!c*_ 
eeaaaa tact 
c ca a a a tccc 
ctcccgaa:a 
ccct. ggcccc 
t tggtgtaug 
ccgt tgata t 
1 t gcccagcc 
acgtcgccg: 
aggcg taca z 
ca tggaggct 
ccgt ccaacc 
a tcagt ccca 
tgcacacccc 
tgctcctcag 
tct t tgtgga 
tqcagctcgt 
aacgccccgc 
tgcgtgaacc 
gagatgtcaa 
1 1 a taccaga 
aagt ta tccc 
gecaggaega 
gcg 1 1 gaege 
aggacga tga 
a tgcgctccc 
a t cagcacga 
atcacgtcgc 
tctatctgtc 
gaettactge 
ttgegatat c 
aggatccgt a 
gcgtcaaagt 
gtggcagtag 
gecat tatcg 
tatactttca 
tgaaacaagt 
act taggta t 
ct g tggcaga 
aeggagcaac 
gatgagactg 
tatgcagtga 
gtgctaggga 
gtagaggcta 
atctccgcat 
ccgacgt tea 
agctggccct 
ggataaqga t 
acgagtcta t 
ccacagcctc 
tct oca c tec 
tegcat tget 
acgggttgaa 
aagcaatcca 
ttcaatacct 
aaacgagaaa 
ccgcgcacgc 
ateagtcgeg 
gcctggcgtc 
ageagectgt 
taa t tgggt g 



a t a t cccacc 
a zeecaaz cz 
tccccggcca 
azccaaazQZ 
z cgcccataa 
•cacccccicc 
i gt cagagg t 
ccaccccgcc 
t tact ccca z 
acacgacc: t 
1 1 gtcg tagc 
cctcgaat eg 
getgecag ca 
tgtcagccaa 
gtgcctga tc 
tgttgctct t 
cga tgcaga t 
gatggt tcct 
ccgggttaca 
t gtggaca 1 1 
agatcatca t 
cgtcatcctt 
tatactcget 
cgccagcgta 
ggtccgccgg 
tgt ctgecat 
ggtacg tgat 
tggagaca 1 1 
agggectgt t 
gcgtgttggc 
eggegt tege 
cgttcggct t 
cgctgtgggc 
ggaggggtgg 
ccacca tgtc 
catt t ccggt 
gagggact tt 
ctata tatct 
a tgatagaag 
a tgetgaetc 
cagccctcca 
gtacttccaa 
caaagcatga 
ggttcata tg 
caa tcactgg 
gcatcct tea 
ttgagtggtg 
ggeggctgat 
ggacctgcgt 
taggeggcaa 
tttaatatct 
tctggcat tt 
gtgcgtgcat 
accta t taaa 
ct tctacccg 
ctaca tcct c 
ggtaaacagc 
cctgtaccag 
tagcctccac 
atgeatttet 
acaaaggecg 
gatgacegtc 
act gatctcg 
teaggcegtg 
tcctcgaa tg 
gceggt tg tc 



ciccgtcgcc 
Gctcttgaac 

Ct C CocCCSC 

aaicca teat 
caatt acca a 
tccgccc t z c 
atcct tct tc 
a i aaactccc 
t ccggga t gc 
caacccgtca 
gccatcgtca 
cacgccgcgc 
tgcgggga t a 
acegcagtag 
gtatccttcg 
ggcgtcgccg 
g tag tea tea 
eggcagacag 
attattatga 
at tcgtgtaa 
gtca tea tac 
gtatggatcg 
gtcgtgtcga 
gactccgctc 
cttgacaatg 
ctgggagtac 
ccagt tgt r t 
ctgcgcctcg 
agct tcgagg 
ca tgtccggg 
agggtcgacg 
gtaggegage 
ttctaca tec 
gggaagagag 
cacctcccag 
tgagga tagg 
cagggca tea 
egaategtet 
agccagtgt t 
tctgcacct t 
gtcat ttca t 
gaatcgegag 
gaccctggca 
eggeggtega 
cctgaatatg 
tagecataac 
caaeggget t 
tggcatga ta 
aagcattca t 
tccttaata t 
ccacccgtgt 
atecttgea t 
ttgtccct ta 
agtatccagg 
cagatceggt 
t t caacctaa 
ggcggagatg 
ct tttcgcag 
gccaatccac 
atct taccag 
ccaacgggcg 
ctgccacta t 
cggcgcagac 
gtct t tgetc 
ccga tgagaa 
gaegatgegg 



ct cccag z ct 
geccaagara 
ccrcctattc 
acca:igtcg 
tcaarctctc 
etc ~ " ccaca 
ar.caacacat 
ccaaagtaga 
gacccgccac 
ccacact ct c 
t caacccaca 
aggtctccca 
tct tcga tec 
tcgccgcggc 
ggactcttcc 
tctccattgt 
agt ccgt cgc 
acacag tget 



gtcgagaacg 
gaaacaaacc 
gccccagta t 
get gcaccct 
tagat teegt 
ccgract ct t 
agctccgcca 
ccgtagtcgt 
ccggggctgg 
cgea tgt caa 
gt tgtggaac 
gegecgatta 
ttgtagtcgc 
gagt ttgegg 
1 1 1 t cccgcg 
atgea tacat 
cct tcaaagc 
ta t ccccagg 
t ctegt t ttg 
gtgggtcgcg 
agattgaaca 
tcactcctct 
gtgtaagatg 
ctcgagtatg 
ttt ttcatgg 
gtca tgcaa t 
gtaccagga t 
tgtTigggggt 
aaatgtaa tc 
agcaaaggct 
tccgatgctc 
agaatactag 
tacccctct t 
taaaca tgac 
cacarctcca 
ctaa ttcaca 
gcatccaact 
tcagcgcaac 
tcctcatcca 
tgtggatggg 
gccgcaaact 
aggtcatcga 
agaggaact g 
cggccgtgct 
gggagcagee 
tagt z tct gg 
gacctg tgga 
t t tangcgct 
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cggacaaicc ctt rcct::: criatgccc: :rcr:cccct -ctcccocco a rcctaggga 8520 
z oaaqcacz c cazcczccco agaczgazga cc:gr:acca gagcacgaac acca: cgc z a 8 58 0 
cggcggaacc ccgaczzczz agatrctctc :a:£cca::c ccccaccaia tcgcgtgtaa 86^0 
caccazgczc naaaacc:ac acicarcccc atggatttct zoacaaccz z gczazaazag 8700 
ctcttggtca actctzzzqc c;te:t::a: t c ngcca :ac gctcicaicc agaccacaat 87 6 0 
ctgcagggga aagagaacac c:araccc:c ccacgatctc aacczzccaa aczczzazza 862C 
tcaaactggt aatagcaccg l gaa cggga t cgcttaggcc ;iaga:caac gact:agagc 8880 
ctggctgcaa gccctaaaca cctgrciac; ccacgr.:ai;a ac: t agcaac t ctcccgagc 8940 
tctcgcaagc ctaccactag cttatagtca atagattgtc gtacatccaq gggcctttta 9000 
agtatttacg gctcatacaa aaagtaatat gagacagcrc tactcc t tt t acatcatcta 9060 
catctatcgc ctagattggt agctcgcagt gggtcatccc tatctgttac caraagactt 9120 
cccgtggtcg caattcaacc taatgcagga ctttctcgtg ggccgattta ttactagacc 9180 
tattgatctt tgattgtggc trcatccggt cgctattact tacagccaca gtgatcatag 9240 
tattcgaatt acaatgttga accrctgcca cgatcactat caggtccttc tagttggacc 9300 
tctctccacc agcgttgaag tataatcccg tagactcaaa aaccaacccg acctgatcca 9360 
ataccaagtc taattataat cagaaaaaaa tatc ttaaaa attaaaattt atatagatat 9420 
ctgctaatcc tcgcagacta tatataagaa acccgtaatc tgcgcggact atagtagtat 9480 
agattttagg acccgctccg cgcggagcgg agcggactgt ggatctgcgg agagttagta 9540 
tacaaaatac tagaggcggc gaggtcgagt tagcgcagtc tagtactaaa taggaaagcg 9600 
gttattttag cgcgaaaaca gggtactaat atttttggac ccccactgta ccttcacgga 9660 
tagtaccaca caactacatt acgattatat atcatctttc cccctcccat ggt t ttccca 9720 
agcaccttac acgtggccgc cctcaactac ctagtttgtt gacagaagtg cagtaacagt 9780 
gacctaactg agtgtagttt agagatatat attctactat agggggctaa accgtggtgg 9840 
attacgcaga ctttttgcta aaagcataat agctacgata ggtagaatat aactatagat 9900 
aaggtagatt agggtaggta cgactgcggc ggcttgatta ttccttgaaa atgacgggct 9960 
cttctggggt agactactgc catagtgaac caggaaggcg gatatcctcg tcgcgccggc 10020 
gcatcatgag cttcgtggtg ggatgaatgc gcacgctgaa agaatggcgg tagtactgca 10080 
acggttcgtc ggggaccacc ctccagtcgt atcgtatgag caagtaggct aacatcatct 10140 
tgatctcctt agaggcaaag aaccggccgg ggcaagcccg tggatgccaa ccgaagccga 10200 
tgtggtcccc gttggtgttc tccagttggg cactgaacgc tttagccggg tcctctcgca 10260 
ggcgcatata ccggtaagga tcgtattt tg ccggctctgg ccagacctcg gggttgctca 10320 
tgcggtcggc agctaccgcc accagctctc cttttgggat aaaggttcca ttggagaaag 10380 
tcacatcctg caatgcatag ctgcgcatgg tggcacattc gacgggcttg acgcgctgtg 10440 
actccttgag acaactatcc agcagtttga gcttgtacag cgaggcaggt gtccaacccc 10500 
cttggccgat gaccgtccgg atctcatcac ggaggggctc aaggagatgg ggatgtcgga 10560 
cgatgtccac caacccaccg atcagcaggt cggaggttcc gtagatacca gcaaagtcca 10620 
tggccagttg cgccccggct gcatcgtacc a tt tcccct t -ggcagta tec tcgaaccact 10680 
ggatcgagtc taegtagega ggcggctcaa tgccctttgc ccggcaggca tctctttccg 10740 
cacgtcgctc ctggataatg ggatcgagaa gttgeeggge tcgtcgaacc tgcgcccgga 10800 
gtttggctcc ctggggctcg agccaatgta caagggggcg cagaatgacg ggecagagge 10860 
gcagctggcg cgcttgaatt gccatcgtca ccgcgtggtg cttggcgatg tcaagccact 10920 
cctcattatg ggctaacttg ctcccgacca taataaaagt cactgttcgg gtcactaagt 10980 
ccagacactg gttatagaca ggcactgtgt gecattctga aagcaaaagc acatccctct 11040 
tcatcagcta atggcctaaa aacaaaaata ctcatcatga tcacttacca ttgctgtcgc 11100 
caaaaatatc cgtgataatc ccgctggctt cattggcaag aggcttgacg tacttgggag 11160 
cttgggtctg gaactggttc ataaccacct tggtgatgag atgtgcatcc ctegtgaett 11220 
ccttgaatcc atcgaatccg ggaagatgag agtgaaagtc ctgatgagag cacgaagatc 11280 
agtaagtcag gtcctcacag eggaagcagt tgcaaagaac ggtggactcc ttaccgtgcc 11340 
caagaacttg tacatacaga gctctttcat ettgegaaac tcatcggcca tagaggaggg 11400 
aagaatggtg cagtacccag agtcgactat gaaccgaatg ggcttatcat tttgcgagaa 11460 
ccagctctca atccatgacg gtgeattege atcaaaatcc cgtttggccc tcatggtcgt 11520 
cagttcccac catgttttcg gattgaacac eggcagatea gatctccggc cactcgagca 11580 
caggtaaaga agaaggcata gtagccccgc actggtagtg accaagggcg caaaccacga 11640 
gccatgttgc tgcgtgtcat tccaagccag cgacagaagg eggtgegget gtgtgagcgc 11700 
gtcgacagtc atggctagga gaccaggtgt ggttgaggga taagatatcg agagtgatgt 11760 
gagcaaaaga teegggaaag gtcgegaagg aaagggcgtc tctcttacca agaaagtctg 11820 
ttccctatca tgcaatcacc gcttgctgta cggtggtgat gatgctggga tggtggtggg 11880 
tccccaccga ataaegcegg acagctgttg aagccgaatg acgccggcag gecaaaagaa 11940 
ccctaccttc acttactcaa tcggcgcttc ccctcctatc accaaategg atgtaaatgg 12000 
aegggectta atagegaccg gccgggccgg gaatccccaa aegtagatag ataggcatag 12060 
acccgaaatc tttggcccgg catacatgag cacaggaaqt ttcacgcqac ggcgcctttc 12120 
ctgcctcagc ttcaatccaa gctcacgagt tctgtcgcct ctaccagtcc tgcaa:tgtc 12180 
etactgeaaa cagcatggct caatctatgt atcctaatga gcctattgtc qtggtcggca 12240 
gtggttgtcg cttccctggt gacgccaaca caccctccaa gctctgggag ctaccccagc 12300 
atcctcgcga tgtgcagagt cgaatcccca aagaacgatt tgaegtcgae acattttatc 12360 
acccggacgg gaagcaccac -gggcqaacaa atgcacccta cgcctargtt ctccaagacg 12420 
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anctgcccco cr:ccacccc gcczzczzca aza rccaccc 1098059000 gagactatgc 12480 
acccccacca c:ccc:c::c ::ccac5ccc :::acc6ccc couaacgaat cc:ccaa:cc 1 2 5 4 C 
ctaeccacca zotocaooga aczzccac^c czczzzacqz cggggtcato acccacgac: 126GC 
atgaaaciOc zzcaacccgc oaocrgcaca gcatccccac c:actcggcc aocggtgucc 12660 
cggtcagtgt :ccc:ccaac ccca tercet at:tt:i;aa ctcccatoca ccaaatcraa I272C 
gtcacccaa: azcczgzacc aguctaaiica cgctctaacc gaeegggarc ct l caaaoa t 127S0 
gacgatcget a cocoa t gea oczcgzcczz qgzzqccgtz catctggcqc zgcaacaocz 12840 
aeggaegggt caaagctcoa tggcaactgc tgegggrgee aatctga::c zccccccca z 1290C 
gacaircgtc etigaaacca aa:tgaccai gczazccccc tcgcgccgat cccgcatatc 12960 
ggacgccgga gccgacccct atgecagagg cgtgagi:ctL tcttgagctc gnagatgaca 13020 
gttcccaccg ctgaccgrga tcaggaagct gtttgocccg tagtgttgaa gacattgagt 13080 
caagccttgc gcgacgggga cacgattgaa tgtgtcaccc gagaaactgc ggcgaatcaa 13140 
gaeggecgaa egaceggaat tacgatgecg aaccatagtg ctcaggaggc actcatcaag 13200 
gctacctacc cccagqczcg ccttcacerc accaaggccg aggacaggtg ccaattcttc 13260 
gaggctcarc gtcagcaaag agaacctgtt ctgttggcgc cctgcagctg acattegtat 13320 
ganagggacc agrsccccgc ccggagatcc ecaggaggeg gaggecattg caacagcctt 13380 
ctccggccac gagcaggLag cacgcagcga eggaaacgag agggcccctc tattcgtgga 13440 
cagtgcgaaa accgttgccg ggcacaccga gggcacggcc ggtctggctg gzczcazaaa 135C0 
ggcgtcgttc gctgtccgcc atggggtaat cccccccaac ctget gt teg acaaaatcag 13560 
cccgcgagtc gccccattct ataaaaacct gaggattccg acagaagcta cccaatggcc 13620 
agctctccca cccggacaac cgcgccgcgc cagtgtcaac tcctttggta agegagaatt 13680 
geceggagga accctcacaa gtactcgaat taatgetaac tgaaccgcgc cgatggacag 13740 
gatteggegg cacgaatgeg catgecatta ttgaggaata catggageca gagcaaaacc 13800 
agetgegagt cccgaataac gaggactgee cacccatgac cggtgtcctg agtttaccct 13860 
tagtcctctc ggegaagtec cagcgctcct taaagataat gatggaggag stgctgcaat 13920 
tccttcagtc ccacccccag atacacttcc acgaccrcac ctggtcctta ctgcgcaagc 13980 
ggtcagttct accctcccgc egggctateg tcggccacag tcatgaaacc atccgccggg 14040 
ctttggagga tgecatcgag gatggtattg tgtcgagega cttcactacg gaggtcagag 14100 
gccagccatc ggtgttggga atcttcaccg ggcagggggc gcagtggccg gggatgttaa 14160 
agaatctgat agaggcatcg ccatatgtgc ggaacatagt gagggagctg gacgactccc 14220 
tgeagagett geeggaaaaa taccggccct cgtggacgct actggaccag ttcatgetag 14280 
aaggagaggc ctccaacgtc caatatgeta ctttctccca gecattatge tgcgcggtgc 14340 
aaattgtcct ggcccgtctc ettgaagecg cgagaatacg attcacgact attcttgqac 14400 
atagctcegg egaaattget tgcgcctttg ctgccgggct catcagtacc tegttgaega 14460 
ttcggattgc ttacttacgt ggagtegtet eggcaggggg cgccagaggc acaccqoqao 14520 
ccatgttggc cgccgggacg tcctttgagg aagcacaaga gatctgegag ttggatgcct 14580 
ttgagggccg catctgegtg gctgccagca attccccaga cagtgtaact ttctctggcg 14640 
acgcgaacgc aattgatcac ctgaagggca ngttggagga tgagtccact tttgegagae 14700 
tgctcaaggt egatacageg taccactcgc ateatatget tccatgtgca gacccatata 14760 
tgcaagccct agaagagtgt ggttgtgctg ttgccgatgc aggt tcccca -<i'ccggaagtg 14820 
taccctggta ttcgtccgtg gaegecgaga acaggcaaat ggcagcaaga gacgtgaccg 14880 
ccaagtactg gaaagataac ttagtatctc eggtgetatt ctcccacgca gtgcagcggg 14940 
cagtcgtcac geacaaggeg ctggatatcg ggattgaagt gggctgtcac ccagctctca 15000 
agageccatg cgtcgccacc atcaaggatg tcctatctgg ggttgacct.g gcatatacag 15060 
gttgcttgga gcgaggaaag aatgatctcg attcattctc tcgagcacta ocatatctct 15120 
gggaaaggtt tggtgcctcc agtttcgatg eggacgagtt catgegtgea gtcgcgcctg 15180 
atcggccctg tatgagtgtg tcgaagctcc taccggccta tccatgggac cgctctcgtc 15240 
gctactgggt ggaatcccga gcaactcgcc accatcctcg agggeccaag ccccatcttc 15300 
tattaggaaa gctctccgaa tacagcactc egctaagett ccagtggctg aatttcgtgc 15360 
gcccacgaga cattgaatgg cttgatggac atgeattgea aggecagact gtcttccctg 15420 
cggccggcta tategtcatg gcaatggaag cagecttaat gattgetgge acccacgcaa 15480 
agcaggtcaa gttactggag atcttggata tgagcattga caaggeggtg atatttgacg 15540 
acgaagacag cttggttgag ctcaacctga cagctgacgt gtctcgcaac qccggcgaag 15600 
caggttcaat gaccataagc ttcaagatcg attcctgtct atcgaaggag ggtaacctat 15660 
ccctatcagc caagggecaa ctggccctaa cgatagaaga tqtcaatccc aggacgactt 15720 
ccgctagcga ccagcaccat cttcccccgc cagaagagga acatcctcat atgaaccgtg 15780 
tcaacatcaa tgetttctae cacgagctgg ggttgatggg gtacaactac agtaagaact 15840 
tccggcgtct ccataacatg caacgagcag arcttcgagc cagcggcacc rtaaacttca 15900 
ttcctctgat ggacgagggt aatggctgtc ctctcctgct gcatcctgca tcattgaaca 15960 
tcgccttcca gactgtcatc ggegcatact cctccccagg tgatcggcgt czacqczatc 16020 
tgtatgtacc cactcacgtt gategcatea cacttgtccc atccctttgc cnggcaacgg 16080 
ctgagtcegg argegagaag gttgecttea atactatcaa tacqtacgac aaoocaoacz 16140 
acttgagegg tgacattgtg gtgcttgacg eggagcagae caccctgttc caqgttaaaa 16200 
atattacttt taagcccttt ccacccccgg argcttcaac tgaccatgcg atgtttgccc 16260 
gatggagctg ggguccgrtg actccggact cgctgctgga taacceggag tattgggeca 16320 
ccgcgcagga caaggaggcg attcctatta tegaaegcat cgtctacttc tatatccgat 16380 
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ccz zccz cac zcaczZ zacc ctgcaccccc 

cCc lCOcCI O C C t C C 5 £ C c S CICCHOCCCc 
cCCCCCGCtC CGcCcc UCct cCUGcCCCCC 

a ccaccc z ca net tcccc^o ccucagcgag 
caaaccocaa cccatt caa c ct ictaoacc 
acacactcac cucccaccc ocaccacac: 
atcgctaica gtcaatgqat artctggaga 
acgtgttggc caccccccac ccggccicea 
gactctccca gcaagcgcgc oagcaacteg 
ccctcgata: ccgccgcagt cccgccgagc 
tcattgcctc caatgtgcta catgcgacac 
actctctgct caagcctgga ggccagatgg 
cacggctcgg gtrtatcttt ggtctgttcg 
getgeactga gccgtttgtc tcgtccgacc 
tttccggtgt ggacagtege accacggatc 
ttagtaccca ngcaattgac gccaccgtgg 
ccaccgtcaa ggactcttac cctcccttgg 
agcgtictccc caacga ca ca aaagegatea 
gcctcgtgga tttgetagae geggaggage 
cggagctgga cgaggaatta ctcgccgggc 
tgctgctcac gtacgccagc aataeggtet 
ctcaccaggc gagcacgatc ggcatgetae 
gagttcatgt tctggacgtc gacgcggttg 
aggtgetteg gcttgaggag cataeggatg 
aacccgaggt ctcctggtgt aaaggccgcc 
ctegcaataa ccgaacgaac tcctcgcgcc 
gggctcccgt ggcattacag aeggcteggg 
aaacctggtt tgtgcctgag agtgttcagc 
ttagctgtcc ccatgcgctt agggteggae 
aegtccagga gggcaatege gaagtgcccg 
ttgtgcacgt tcgtcccgat tatatatata 
gtggcagcct tatggtaacc gtcctcgccg 
ccgccaagtg cctgggggta actgactcaa 
ggcagatgtt gctccatgct ggtgaagaga 
ctggcaacag gagttcggtt tetgetggag 
gcgacacgga ctggcacctg egaegggtae 
tatcagcega ccagagctgt gaaggtttga 
gctgtgccca ttaccgtgcg gcagacctgt 
gcggatcgcg gcatcaagct tcactgcccg 
cccgccaggg actccctagt gtcagegagg 
cagcgcatgc cgacaagacg cgcccggatc 
eggacgagge cacgcttcct accagggttc 
ccgacaaaac atatctcctg gtcgqactga 
ggatggtcca geatggggee tgccacattg 
ccaagtggct ggcgcatgtt gaagaactgg 
aggagtcctt ccttctgcaa ttcctcctta 
acgtgacaag ccaaaactca gtggaagctg 
caccagtggg gggtattgee tttggccctc 
tggaactgee aatgatggag atggtgctca 
acgagaagtt . ctccga tccg accagtagca 
cgattgtggc cgtcatgggc aacccgggtc 
ttcaageget ggcgcagcag egagttgeat 
tttgetaaac actcctatgg gectttacta 
tqccgtqtac gqcattgggt tcgtcactcg 
tcggttcatg ttcgattegg ttgaggaaca 
ggtggccggt cgacgagccg tgcaccagca 
cgacatggct qatctggaac tgacaacegg 
tcggatcacc ttcttcgacg acccccgcat 
ggecaaagea ggcgaagggg cagccggctc 
ggegacgaac ctggaccagg teegtcagat 
atattctccc cttcctcact cageggactg 
tggactctcc gcgaagctgc aggtgaccct 
caccatccca ctaatcgatc agggggtgga 
gttctccaag cagctgtacc ttgatttgee 
caccgatctc gctaatgagq ctgctgcgcg 
agccaccgac gggggtgcag agagcactga 
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cccaccaccc accc*;tccat ttgeagaage 164 4 C 
ccGCcaacca cccicgccac ciatcgtaca 165 Oi 
acactcccca cc zzzczacz cctaac zccz 1656T 
tcggccaaca cczcczcccc acccnacaat 1 6 62 C 
acgatcccct cctcacgcac zzczazacca 166£I 
acacccci'ca a::coi; occc cacazcoccc ~ 6- ~ 4 1 
ttggaccacc gaecggccgc cczaccaacz 16S0C 
acscczazac aiacacccai a zcz ccaccc 16861 
ccccczzcca c cac eg gar a grcttcgaac 16 92 C 
agggcttcga gccgcatgcc tatcarctga 1696C 
ccgacctaga gaaaaccatg gctcacgccc 17040 
ttattctgga gattacccac aaagaacaca I710C 
ccgaczccz g ggctggggtg gatgatggtc 17160 
gctgggangc gatcctaaag cgtgtcgggt 17220 
gggacgcaaa tctattcccg acctctgtgt I728C 
agtacttaga cgcgccgctt cccagcagcg 17340 
tggtgg-Laag agggcagacc ccccaaictc 1"?400 
zgcczccz eg tccgccccag aca:acaagc 17461 
tgccgacgaa gtccacgtt: gtcargctca 17520 
ccactgaaga gacctucgag gcaaccaagc 1 ~ 58 C 
ggexgacaga aaacgcctgg grccaacatc 17640 
gctccatccg cegggagcat cctgacttgg 17700 
aaaccttcga tgcaaccttc ctggttgaac 17760 
agctggccag ttcaactaca iggactcaag 17820 
cgcgga zzee tegtctgaag cgcganctgg 1788C 
qzeccazaza cgagatgatc gattegtege 17S4C 
atrcaccacc ctacttcttg gagucegctg 18000 
agatggaaac aaagacgatc tatgtccact 18060 
ageteggget tttctatctt gtgcagggtc 18120 
zcq tggee: z ageagagegt aacgcatcca 1616C 
ctgaggcaga taacaatctg tctgagggtg 18240 
cggcggtgtt ggcggagacg gtgaccagta 18300 
tcctcgt:ct gaatcccccc agcatatgtg' 16360 
teggtet tea agttcatctg gccaccactt 13420 
acgccaactc ctggctaaca ttgeatgetc 1848C 
tgccccgggg tgtccagqct ttagtcgact 18540 
ctcagaggat gatgaaagtt ctgatgcctg 1S60C 
tcacagacac cgtttccact gaattgeata 18660 
ccgcatattg ggagcatgtg gtatccttag 1872C 
ggtgggacgt gatgccgtgc actcaatttg 18780 
tctcgacagt tatttcctgg ccccgggagt 1884C 
gcicca c:ga cgctgagacc etctttgegg 18900 
ctggagatct tggacgatca etaggtegtt 18960 
tacttacgag cagaaatccg caggcgaacc 19020 
gtggtcgagt cactgttctt :ccatgtaag 19060 
tgatccccac taaegcaget ggcttcaggg 19140 
gcctggctaa actcaaggat ctgcatctgc 19200 
tggttctcca ggatgtgatg ctaaataata 19260 
accccaaggt egaaggegtc cgcatcctgc 19320 
accctctcga ettcttegtg atgttctcct 19380 
aggctaacta cagtgegget aactgctacc 19440 
ceggattage agtacgtttt cactccatcc 19500 
aacegggcag gcgtccacca tegacategg 19560 
ggeggagerg gaggaggact ttaatgcaat 19620 
tgaactgeat acactgtttg ctgaggcagt 19680 
agagcagcag eggaagtteg egacagtget 19740 
aattccgccc ctggatccag cccicaaaga 19600 
aggcaactta aaaattcegg agtaccgagg 19860 
caagggctcg gtcaaagaac agetcttgea 19920 
cgtcatcggt aagttgagcg satcegggga 19980 
gagattaacc gcttctt ttc ctttggcaga 20040 
gcagatcccc gatggggaaa gcgtgcatcc 20100 
ctctcugcgc gcggccaccg tgggaacctg 20160 
acrcccgaaa gtgcttgggg gtgettcgat 2C22G 
attgccacct agctccattc ccctcgtcgc 20280 
caazaczzcc gagaatgaag tttcgggacg 20340 
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caaoG£ tac" qacczzaczc cccccccca: cazcaczcac czczcgzczc ccz a z zaaca 2C4CC 
cga tacgcac ' ccgggccacc aggacczccc gccttcccac Cctccaccc: ct;;:cccca 2046C 
a caa Caen cc nocagaatcc agcacgcacc cgaagacccc accczczzza acaazaccaz 2 C 5 2 C 
cggtatgtLC atgaagggcu ccauoaccc naaacggccg tacaagqcgt tgaga gcggi 20580 
ctngcgcccc cacgagatct tccgcacggg gtrtcccaac grggacgaga acgcgatggc 20640 
ccagctggtg tttggtcaaa ccaaaaacaa agtccagacc arccaactgr ctcaccqagc 20700 
cggcgccgaa gagggctacc gacaacrggc gcagacaccg rataaccctg ccgcaggaga 20760 
caccttgcgg ctggtggaci tc^tctgggg ccaggacgac catctgctgg ttc:gactta 20820 
ccaccgactc gtcggggatg gatctactac agagaacatc ttcctcgaac cgggccagct 20880 
ctacgacggc acgtcgccaa gcccacatgt ccctcactti ccggacctgg cgccacggca 20940 
acgcgcaatc ctcgaggatg ggagaatgga ggaggatctc gcgtactgga agaaaatgca 21000 
ttaccgaccg tcctcaatzc cagtgccccc actgatgcgg cccctggtag gtaacsatag 2106C 
caggtccgat actccaaaie zccaccaczc zggaccczgg caccagcacg aac::::goc 2 II 20 
gcgacttgat ccgatggtgc ccrtccgcat caaggagcgc agccgcaagc acaacqcaac 21180 
cccgatqcag ttctatctgg cggcgtacca ggtgctgttg gcgcgccxca cccazagcac 21240 
.cgatczcacc gtgggcctcg ccgacaccaa ccgcgcgect c:cgacgaga tcgcccccat 21300 
ggggttcttc gccaaccicc ttcccctgcg ct tccggga t ticcgccccc atacaacatt 21360 
tggcgagcac cttatcgcca cccgtgacct ggtgcgtgag gccttgcagc acgcccgcgt 21420 
gccctacggc gtcctcctcg atcaactggg gctggaggtc ccggtcccga ccagcaatca 21480 
acctgcgcct ttgttccagg ccgtctccga ttacaagcag ggccaggcgg aaagtzggaac 21540 
gattgggggt gccaagataa ccgaggtgat tgccacgcgc gagcgcaccc cttacgatgt 21600 
cgtgctggag atgtcggatg atcccaccaa ggatccgctg ctcacggcca agttacagag 21660 
ttcccgctac gaggctcacc accctcaagc cttcttggag agctacatgt cccitctctc 21720 
tatgttctcg atgaatcccg ccctgaagct ggcatgatgg cgcaaacata aaacataata 21780 
gcgcagcagg gacgatgtag atagagcttt gcttctgcgg gtggatctat aata:agtat 21840 
atataaatat ggtgagccga acgaagaggg gggaatgcca caattattta etc::: zgcg 21900 
ccgtacacga ggagaagacg tccagaacaa cataaatata tcactctagt gaaacaccat 21960 
atatteggag agactataaa aatatacatc tactccaatg tcngggccgt cacacacagc 22020 
ttacgaaaac gattaatgac ctccaacacg tcgcgcggtc gattgggaaa ctaa:actgc 22080 
ccagcaaact ccaatacctg cgcctctcgg ggggagaaat ggcgcgccac cagcatcttc 22140 
gatcctgega gegcaaaate atcgcgaccc tgcagatgta atgtcggtat ccgaaraacc 22200 
agttcctcct gccactcggt atetttgetg tcgttgtcgt cgtcatggtt cttcatcact 22260 
tcgttcctca tatactggct tgcctcgtct tgataccagg gacagatcaa cagcgcaaca 22320 
ctcatccggg gcaaccaggg caggtgaccc atetgetget gecagaggag caaggtegtc 22380 
accagggcac etteggagaa accgatagca cccacgatag ggatgtgggg gtgttaagtc 22440 
tgecagtega caatggtgcg gcggatgggg tcgtggacgg eggegaggeg ttccctcacg 22500 
gagggtccat tatgattgtt gtcgctgctg ctttcaaacc aggagtaata tggecctagg 22560 
teggegaaga eggggagaat cccaggccct gcagaggaag ggaaeggage tgtcaegtag 22620 
acgaattcaa agttttcgcg cagcgcagcc eggagcttag acaattgeae gcgcaagatg 22680 
gegggagage agccagcccc atggatgcaa agaagegetc gaggegcett caccaacact 22-740 
ggagatgett ggtaaegcat ggaggagggt acaatgggac tatatcctgg atgeaagaca 22800 
gggatgaggg agtgtcgagc ttacacgttc accagegatg aacegctatt attccaaegg 22860 
aatatcctgt ctaacactct gcatctactc caggtggacg ggacaageca gega^acega 22920 
ttacatccat aggaacagca gtttgttcgg aacttccgct ccatcccctc gata:cgggc 22980 
geaggatgee ggcgccgaat aacgccgcac aaacccgaac ggatctgcag gtga^cccga 23040 
agccctaatc caaggatcgt ccgtccttct gtctatqtct tcccgcatat gtagcccgca 23100 
gegtaccaga tacgtcactc aacagttaac cagagaagac gaeegtgaca gaergecatg 23160 
ggcgaccagc cattcattcc accaccgcag caaacagcgc tgacggtaaa tgaccatgat 23220 
gaagtcaccg tctggaatgc cgcaccctgc cccatgctgc cccgcgacca ggtaiacgtc 23280 
cgegtcgagg ccgtggcgat caatcccagt gacacgaaga tgegeggaca gtttgccacg 23340 
ccctgggcgt ttcteggaac ggactatgee ggcaeggteg tcgcagtggg ttcqaacgtg 23400 
actcatatcc aagtgggtga ccgggtctac ggggcacaga acgagatgtg cccscgcacc 23460 
ceggatcagg gggcattctc gcagtacacg gtcacgegag gccgtgtttg ggecaagate 23520 
cccaagggct tgtcgttcga gcaggctgcc gcgctacctg egggcatcag tacccctgga 23580 
ttggcgatga agttgcttgg getgectttg ccatcgcctt cggcagacca gccacccacc 23640 
cactccaagc cggtgtatgt gttggtctat gggggcagta cggccactgc cac:grcact 23700 
atgcaaatgc tccgcctgta atgcttccct tgtcctgaga cttttctctc cgttggtcgt 23760 
gggctgtaca agcgatggtt atactaagat ccgctggcag gtceggatat attccaattg 23820 
caacatgctc cccccacaat ttcgacctgg ccaaatcgcg eggegcagag qagcrctttg 23880 
actateggge cccgaatctc gegcagaega tcgtcagtga acccctgcca ccgctictacc 23940 
cctcccagtc cactttggcc ttacagaaca gactattgat ancttctag cgcacccaca 24000 
ccaagaacaa tctccgctat gctctcgact gtatcaccaa cgrcgagtcc accacattct 24060 
gcttcgcagc catcggccgc geggggggge actacgtctc cctgaacccg ttccctgaac 24120 
acgcggccac gcgcaagatg gtcacgaccg actggaccct ggggccgacc atctciggcg 24180 
agggatcaac ctggcccgcc ccctatgggc gtcccggcag tgaggaagag eggcaatteg 24240 
gegaggatet gtggcgcatc geggggcage tegtcgaaga tggacgcctc gtccarcatc 24300 
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ccttacccgt 

agggagaact 

ccgtracgtc 

aaacgtacat 

gg 1 1 1 1 aaga 

ttgctct ctg 

aca tccacgr 

tatatgt tag 

ta t ca t cccc 

tgctcgaa tg 

aacgcaaggg 

ggctgga tea 

acaatgt ttg 

agatcctcca 

ggcccaccgt 

cgegget cga 

tetcget t ca 

tggccgccga 

cgcccatccg 

tgetgeaget 

tactget cca 

gcgccgt aga 

a tgecaaact 

tggaggaaga 

ccccgacgct 

gcgctcaaat 

atgcattgta 

gtgtcgacct 

aagcagcgcg 

ttgtaattgt 

tgaccgcccc 

caaccggatc 

ttt tccctt t 

gaaagaaaa t 

ctatccaagg 

taagaccgac 

ctgtattgta 

cagctgtatc 

aaggtaggta 

tctcccgcat 

ceggagatag 

tgactgtggc 

caagagccgc 

agcggcgcgc 

acatggctcc 

gcatcgagac 

acccacgtga 

tttcatggtc 

cagaaacacc 

catggcactc 

cttcaggacc 

gcggccgccg 

ggtaaccgta 

aacgcccttc 

gtagagatag 

cgcgt tgaag 

ttcagctgea 

tttgagcatg 

cgcaacgagc 

catgggtccg 

geca tgcaca 

gcagaggt tc 

egtaaatgee 

cttctctaaa 

cgtcagagat 

acagacttca 



ociccaac 



gtegggggae aaactegteg 
gacgoagccc caaagctcca 



ggaag^ 
catacaccta 
caaoctca z t 



acagegagtg 
agcactccrc 



l tCCct C cCcttccCCc 

teggctega 
tt tttctat 
gggatattgc tgcgtggatt gggataaacc 
atacattgaa cacatactca aa ca act cut 
tccc tgt cccc age a t a tea t a ga 
taactatege tatatatata 
ctcttctaca gaacttaaca 
gccatggctc etatgeggat taaccctgct 
tgcgtgtcac atcacgacag acegggtcat 
tgcacaggcc gectttggge tegatttget 
acattgaacc tgacataaac caaaggaatc 
ggccaccccc aaaggtcaag gaaccttttc 
aggegatgat cccccccagc ccaaagctgc 
agttgatatg tgggctggcg tccatgtgct 
gggcaggctg aaacatcaag tccacggtct 
aagcaccttc atataggacc 
tccatcggcc eggaagtaga 
ccgagttgcg gtgcgtttgg tcggcccggc 
taaaggtcat gtcggtgatg cccagcggcg 
ggtcgaggee ggtggcccgc tcgacgagct 
tccactccgc cccagggtcg ttgacggccg 
tttctgeega ctggaggtgg ccctgggcca 
cgtacgacag tcccgatgta tgegtcagea 
ctcgcaatct tgcgtttccc gcgtcgtcaa 
ccggaagcag cctatccacc gtctcatcca 
gggecatgat cgtggtcagc agcttggtcg 
gtageggegg cagctgattg cactcgtccc 
tataatttag attgectagg tatgtcccgc 
cgcaaagggg tccgattggg gtctctcacc 
ggggatctgc ctggatttca cggccttgcg 
cgctgccgca gcagcatcaa tgatggatcc 
ttactaataa ataaatagat gaaaataggg 
egggegctta ttttgtctgc ggctggggcc 
gegatcateg gategggecg gcggatcgtt 
teggtgaett tcggtgttat gcgccgctag 
aaagctacct aacctcactt aggtaggtag 
ttacgtacag gtccgtgata ctactacctt 
cttttctacg taggtaggta tttttatatt 
gttctaagat taccacatat accggtgttc 
cgtcctttga agctgccccg agaatcaatc 
acccgcactg cgattgtgtt gcatatgggc 
gcaggtactt aactcgccag ccagcacggc 
attggegeca ggctccgtgg aatgtgcgcc 
ctgggcctca agaatggtgc ctccgccaat 
agcgatgtgc aggtttccat egatgetaga 
atatatataa cegggaggag aaaatcaaaa 
gtaatgeaac tactgctctc cacattctgc 
gectggaega ggttggaggc atgggcgttg 
ccgaccaggt tcttggccgt gttgagctcg 
tgtcggacag tctccgcggg gatcgtggct 
atccagttaa tggcggcgga tttcttgtcg 
tgcatgtcgg gaaatccacc ctcggcggcc 
gaaatcatat tcatgcccat cgcgtcgccc 
atgcccgcct gggccaccgt cagggtttgg 
gcggccgcca gaacctcgtg cccgagagga 
cgttgggccg aegggaateg cagacaggga 
gtagtggcac cgccgccagc attgatcget 
acgccctcgg ttgtggccat gggaatgaac 
gccactccca ggggcagagg caggtacccg 
agetegtagt tatagtcgeg ataagggagc 
tgagtggagg gcgttttcga cacagcggca 
tccagccggg tcaccgagcg ggatgatccc 
ctgtaccccg cgatcttacc ccggagacac 
tctgcttggt tcgatttgag gagggectet 
ggggcagtgg cagagcatgg caccgacaag 



aggcaeggag 

ccccccc" a 5 

caaccaatcc 

acgagt gata 

tatgaactga 

1 1 aagagaaa 

gatatatata 

a t cgaaa t ac 

ggt actgege 

t ccagggt tc 

aa t tacaat a 

taggagcega 

ggcgccag 1 1 

gacgaaggac 

ggttca tct g 

gtggctgcag 

cagggcccga 

ccgagtcg tc 

gegcaagcat 

cacagatatt 

t acccgccca 

gcggcgccag 

tgtattcccg 

ggtgccgcag 

acccctccag 

agtccacgag 

cactggegag 

gtcgcaccgt 

tgtca ttcgt 

actgeaa tct 

gaaggcggt t 

cat tctgaat 

aa taaaaata 

a tagat egga 

aca t caggee 

gaaeggtat t 

gtaggtacct 

acctacctaa 

cga tt tgcac 

taattgetec 

acagcgatcc 

at tgaccaag 

cgcggcgaca 

ccggacacct 

cgtgccgacc 

aacccaccat 

cgtagagaag 

gccggatcct 

aagccgccca 

accagcgca t 

teggegatga 

gaacagaaat 

atcgcctcca 

gtggt ggtgc 

agccgcgcaa 

gactcaaccc 

ccacgcgtca 

t tgcatccgc 

aacgcctgtc 

accacgt t ct 

aatgactccg 

eggegaa tec 

gccgcaatcc 

agtt ccacca 

gtttcttctg 

taggagecat 



crcgtcccca 

a ct cga t z gc 

gt aga cget a 

cacagggtgg 

t atacactac 

aaaaaaaaaa 

tata t eta ta 

aaa tagecaa 

atagatgeca 

cagt t ggaag 

gcacn tag ta 

get t acccac 

ctct ccgccc 

catgggca tc 

ctct tcgagt 

caggagcccg 

gaacaccccc 

gtagegcagg 

atccggccgt 

ctcctgcagg 

gtccaggttg 

gcgactctgg 

gagcaaegga 

cgtgatcttc 

caegggcat c 

accgcgctcc 

ccggcagggg 

ccgagccccg 

ctcggtgtgc 

egggecatga 

tccatcagaa 

tttttaattt 

aaaccaaaaa 

cct tacctac 

ggtccccgaa 

gtcttagact 

aggtactctc 

gttacatgac 

gaaatggatg 

gtactctgta 

cttcatttct 

tgacccgccg 

a tgegggeca 

agcaagtcca 

tcca t cgagg 

tagcaaacac 

ctacctactt 

gaccagtggc 

ggctgcccgc 

egaegt eggt 

cggacttgcc 

tgccagataa 

gggctt tt tc 

ggaaccgga t- 

accggctggt 

agegctggge 

tacca tcacc 

gaetggeget 

catcgatcac 

cacagcaggc 

ccaggccgct 

gcacggcacg 

gctcgagagt 

gctcgtcgtc 

gggggcgagt 

cagggaegge 



24360 

2 4 4 2 0 

2 4 4 8 G 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24 960 

2502C 

25080 

25140 

25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 
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ggactctctc ccttgggccc aaccatgcca gcgagcaccc z gaazcaqqz ggttgttgag 2832G 
gaccagccta ccgaccaatg cgcccetgac acatagtcaa cccagcgggc tttccaatcc 2838C 
atcgagcaca ccgtcaaatc tagaacgttt agtggczca: azazzgaacc cagctgattc 284 4 0 
caagacgrac tgcagtggcg gcagaacgct tacgcgtgtc ccaaacccct cgacgcgcgc 28500 
cgttaggtac auctcgttga ctccgttggc cgcttccuc aaaggactga cagcagtagg 28560 
gggcagacta ccattagcca tatattctcc catgacccga tagaagaagg gcgacaattg 28620 
gaggaagtga catiagaacga agccgcccac cctcaaracc crccaccagg tgatgtttgt 28680 
cgacttgacc :cgr.gcccaa aaatccccga aggatgctrg gcattaactt gaccaagccc 28740 
tcctgggcce cggacrcgcg cgatctctaa tttcacgcaa agaatcgtgg catagaaggt 28800 
aaaaagtaga acggcatcaa tcaggagtgt ccatgccccc aggaagcaaa aatggccaag 28860 
actatccttt ggccgaagga ccgcccctaa tgccaacgcg ccgatttcca ggaggtaaga 28920 
tcgcacaata taccacccct ctcgatcaac cgcgagtcgg atgatgttag gaatcaattg 28980 
gttttgccgg ctatcatcac tgctggcgcc attgggaact tgccgctgcc ccccgcccca 29040 
cagttcttcc gacacgcaga gaacagcacg ggttagttgg atcggcttct caaagcccac 29100 
tgtcagaacg aggtacggga ttccttcgaa aagaagaagc atgtcgacgg gcacgtcgca 29160 
tgtagtcgtg attccaagcc cgagtacaaa agcaaaggca ccagacagca ggaccgaggc 29220 
tgccaaccag aagcgtgagc cgaggtggcg catcacccga aagagagaga ccacagtcat 29280 
attcatggct aggtaactga gccctatgat caccaagtcg actgtctccg catggtgaac 29340 
acgatgaagg aatgacagcc atgagcttcc gagccagtgt cccagtgacc tgggacttcc 29400 
ttcttcgcga ggagacctca gtctccaact atcatcctct ttttccgagg gtataatttc 29460 
gacagcctgt agaaagccat ctagttggga gtagggaact cgaaacgtca acgcggaatg 29520 
atcgggtgag ggggatgaat cgggggtggg cgtggtctgt tccgcaccga ggaaccctga 29580 
gagcgtgttt gataggaagg ggatactggc gtcgacagac gcacccggta gatcgagggt 29640 
gacaagagcc cagtgaaagt cagactaatt gagagtagca cagggacggt ccggtcagca 29700 
cgcagaattt ctcggaccaa tggagcgcag gagatctatc atacctggcc atcctccggc 29760 
accttaggtc ggtcgtctat ctgccatttc catgcgctag cctctcccag gctaacagtg 29820 
cggcttcccc agagaaatga ctggacgttc aagggggttt ccttggccca gggggccaga 29880 
cctctgtgag tagcctgata tgtcccttcc agcacatgaa gatgggtcgt tgccgcggtc 29940 
agcgccgtga ctagcagagt gtgaatggga tgtcgacacg cgtgtcccac aatggcacgc 30000 
aatgctttgg tcactcgatg ctgcaccccg ccagggtccg gctttctaac caccggatcc 30060 
attgcgatga ctctcttggt gcgcgagaga ggggtgtgag tgcttgtgga tgatgtaaag 30120 
acaagtacaa gattcgcaga aaggtcaaag agactagaaa aaaataaaat aaaaaaatta 30180 
aaataaaatt aaaaaaaacc cctaaagaac aagaaaagaa gagagaaaga aagagagaga 30240 
gagagagaac caagcctcag ggaaggaagg aaataagcca aggcaactcg cttggtgcga 30300 
cccaatgcta ggaggtccat gagactccgg tacttcctgg taatattgag ccaaatatac 30360 
ttattctagg taggcggtag ggattttaat gcgtatgcct atgatacaat aatggtacat 30420 
gaaggcgact gcacaaacaa tcttacgccg tttggctcgc tggctagaag agtaagcact 30480 
aaacaaggcc agggttgggg aaatcttctc cactgtcaga cgtccttcgt accttagagg 30540 
tactacctta ccgaggtaag gtactttatc tgaacacgga attacatcct cttaccacga 30600 
attttcagat ctcccaattt tgtaggttat gaaatattga aatttattga cctgcccccg 30660 • 
tccattggct aggtattaca cacggctgta tctctatctc atagaatggt atctagtacg 30720 
ctcttttcga tattacaaag gtggcttttt caggaaattt ggaaatatcc agtgggaaga 30780 
gggtgtacgt actctagtag gtaaggtacc ttacctacct caccttctgg agtcctggta 30840 
agattgtttg tgtaatgaat cgatcagctt agtgcgctaa aattcggtca ccctgtagaa 30900 
acataacaat atttcctttc ttcagcccct acctaaatta ggtgttcaaa gccaaggagt 30960 
aatacatacc gtcagtattt ctaggttggc aggagggatt ttgatgctgt cacccttggc 31020 
gcatcatgca cgattcatga aacagactgc actatccgtc ttacgcgcct tagcgccgcg 31080 
gctgaacgag taagcactca gccaaggagc ccaccccctc ccggccccac cccctggcag 31140 
ggctgtcaaa gcggatcgag tctcattggg tttgcttatc aaaatcggtt ggtaatccag 31200 
gtagcagtag catctggatg taaactaata aaaaggcaga ctgtcgggac cttaaaaggc 31260 
caaagaggta cccatgcccg ggttcttcgg tcgccagaac tgcaatgtat caaatgtctg 31320 
atgctcatga cccaggacac agcacagtac aggggggcaa tggctgcaga tcaaggtata 31380 
ttcacgaact cggtcactct ctcgccagtg gagggttcac gcaccggtgg aacattaccc 31440 
cgccgtgcat tccgacgctc ttgtgatcgg tgtcatgcac aaaagatcaa atgtactgga 31500 
aataaggagg ttactggccg tgctccctgt cagcgttgcc agcaggctgg acttcgatgc 31560 
gtctacagtg agcgatgccc caagcgcaag ctacgccaat ccagggcagc ggatctcgtc 31620 
tctgctgacc cagatccctg cttgcacatg tcctcgcctc cagtgccctc acagagcttg 31680 
ccgctagacg tatccgagtc gcattcctca" aatacctccc ggcaatttct tgatccaccg 31740 
gacagctacg actggtcgtg gacctcgatt ggcactgacg aggctattga cactgactgc 31800 
tgggggctgt cccaatgtga tggaggcttc agccgtcagt tagagccaac gctgccggat 31860 
ctaccttcgc ccttcgagtc tacggttgaa aaagctccgt tgccaccggt atcgagcgac 31920 
attgctcgtg cggccagtgc gcaacgagag cttttcgatg acctgtcggc ggtgtcgcag 31980 
gaactggaag agatccttct ggccgtgacg gtagaatggc cgaagcagga aatctggacc 32040 
cgtgcgtcgc cgcattcccc aactgcttcc cgtgagagga tagcacagcg ccgacaaaac 32100 
gtatgggcaa actggctaac agacttgcat atgttctcac tagatcccat cggaatgttt 32160 
ttcaatgcgt cacgacggct tcttactgtc ctgcgccaac aagcgcaggc cgactgccat 32220 
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caaggcacac 
a t a t ugaarg 
a cccacaaca 
gacaccagca 

CtCCCtctte 

cgcactacct 

C 8 C l C C Q C. C 

gccacgacac 
CGCCl ttrct 
tcccgaggt c 
cgcaaacaca 
gcct t tggaa 
tctcagtggg 



tagacgaatg 
tgcgga tttc 
gcca t a tga c 
gcagcagcgg 
gtgagctcc ~ 
tacaccttgc 
cogGca"iuC 
gggcgaccaa 
tcatgttct t 
gaacca tcgc 
aaca t ggca t 
gt gtgtgcaa 
gtgggaagat 



z ctacgcacc 
gaccgccata 
cccactcgaa 
ccacagcagr 
ctcctatctt 
ggtacaattc 
aoc l tec a tic 
1 1 ccgea aga 
gag tea ccaa 
agcact gega 
get cagagac 
ccactgcg ta 
a taggaaatc 



aagaaccuct 
t eggagt tgc 
cgcacz cgaz 
ctiga caeca 
caccccctga 
ctgee t gaga 
aaca t caccc 
c gegaggage 
ggggct t tec 
cga tgcta tg 
ctcaacaa t a 
gcgccctgtc 
ggacatcgcc 



t tacggcagt 
tcctgtcgca 
cccacccccc 
taccct tct t 
cacacgccct 
atgacattac 
gggaaccagg 
agccgaccac 
aggaggcaaa 
aggatat ct t 
t tcct cca tg 
a t tegg tgee 
acatcaaccc 



acacigt t ac 
aattaggege 
gaccagagac 
iiagcgagaac 
a ttctegget 
tct gggagta 
cgagga t a ta 
tccagcggct 
gtctget ggt 
t tccct cgee 
aaccaa t cca 
ggacagagtc 
tiacacccac 



32280 
2234C 
324CC 
324 60 
3252C 
22580 
32640 
32700 
22760 
32820 
3288C 
32940 
23000 



<210> 19 
<211> 31328 
<212> DNA 

<213> Aspergillus terreus 
<400> 19 

ctaccacgaa acttcagatt ttcctatttc aeggecttta gaatactgac ateegttgat 60 
ctttccctat tttctgtgag gtagtaagtg tggcagtatc tccctcccat atttggcagg 12C 
ctcttttcaa tattatatat agaagtggcc ttttcttaga aattggaatt tectggegag 18C 
agggegtegt ctgagactct agtcacttcc aactattgea gctacatgga ggagtacctg 240 
ttgactaagt tgtactctgt gatttgtttg ataaacactc aataggtttc tagcacgtcc 200 
cagttgtagc tgagcactca actagctaac gggtgagaac attgaagegt caattctttt 360 
tacceggaat acttttaaca gctatgtatg gttatttggg aceggggaag gaattaaatc 420 
aegcaggget tcttcttcac tcccatgcga cctactcaca ttgtgttcaa aagtaeggae 480 
gtgtctggat caatggctcc gaeagaegge caagaaaacc aatgcatttg ccaagaatac 540 
aaatggctac gtactagtcg agtagggege ggtcctgett gccgcctcca aaggaaaacg 600 
ggcatttact agatttgttt gttgcaggta ttgtactcgg atagtccagc ccagggatca 660 
attgactttg agacaagcag cgggccccgg cggccaatcg gcttttaagg egggtagggg 720 
tcgggtggtc tactgtatgc ggttcctccc gatcaccaca ttaegtctta cattgeatae 780 
ctaggtaaat cttatggtac tgtaggtata cacaatattg gtgaaagtta ccggtaaaaa 840 
gaaaatggaa aaactagaaa ggcagaaatt ttaccggtaa gataagtgaa tttcgtttta 900 
ttegcaggge atgatggtat gataggtaca tttacgaaaa cttgaactca tgataatacg 960 
gtaccacaga cagcttaggc gcccctgcat cccctttgta ctctcgtact ccatatagaa 1020 
aagcttgtat gtttggctgg acctgctgcc aegagtegge ctggtccgtc ttactacttg 1080 
tatatcagag ccgctttgta aggttttggt tcgcctcggt tttgegatte catatgggaa 1140 
cttttagggt aeggaaatec atgtaaacaa taccagctga agtacctact aggtgccacc 1200 
taggegctaa ggtacctaag tacatcttca tcccaaaaac aaagataaag gagttctaaa 1260 
ttacgttaga ttccgcattg ctgcagcgac acaegtgatt ttaggcttcc catcaccagc 1320 
aagctcacct ccaggaggcc cctttccctt ggacctgact aaggcatcgt gttcgccggc 138C 
aaccttacca taaggtagaa tgacatccca ccacggtgaa acagagaagc cacagagcaa 1440 
cacggctcaa atgcagataa atcatgtcac tggectcagg etaggectgg ttgtggtttc 1500 
agtcactctg gtggcgtttc tgatgetett ggatatgtcc atcattgtca eggtcagcat 1560 
ggcaccagcc tggagattgc tccgagcctt ggagacaact gactcttcac attegcagge 1620 
gattcctcac attaccgccc agtttcattc ectgggegat gtcggatggt acggaagtgc 1680 
gtatcttcta tcaaggtgat cgattttcca acccatgccc tcttcctttt ctccagccgg 1740 
gtttctattg actccacgac acgctctagc tgtgccctcc aacccttgqc aggcaaacta 1800 
tacactctgt tgaccctgaa atacaccttc ctcgcttttc tcgggttgtt tgagattaga I860 
tcggttcttt gcggcactgc tegttegtea accatgttga ttgtagggcq agcaqcagcc 1920 
ggaatgggag ggtegggget caccaatggc gcaatcacca ttctgtcggc gacagctcca 1980 
aagcaacagc aaccgegtaa gtactgatag ccagacctat ctcaaccgtt gttatgetat 204C 
gctgacccgg atatttacac atagtcttga ttgggatcat gatgggccgt cagttcgcca 2100 
acccattggg atccccggaa atcatcaagc atagtttctg actccattcc cagtaageca 2160 
aatcgecatt gtatgtggac cgttgcttgg gggtgctttc acgcagcacq caagttggcg 2220 
gtggtgtatg tatccccatt ggatttatcg gttcagtgct tgetttctea aaggaccttg 2280 
gctacgactc cgccacgtca agatctttcg ctcacggtga ttctggtcca ggtttttaca 2340 
tcaaccttcc cattggggcg tttgecacat ttctccttct cgtcatccag atccccaaca 2400 
gattgecate caegteggat tcaaccacag aeggcacaaa ccccaagaga agaqgggctc 2460 
gggaegtett gacccaactg gatttccttg gattegtget cttcgccggt tttgegatea 2520 
tgatatctct tgctttggag tggggtgggt ctgattatgc gtggaatagt teegtgatea 2580 
tcggcttgtt ctgtgcggcg ggcgtgtcgc tggtgctgtt eggatgetgg gaacggcatg 2640 
tcggcggtgc agtggccatg attcccattt ccgtggccag tcgtcgccaa ccctqgtgct 2700 
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:at tttctcccac 



:tTitccgggg ccct 
tccacttcca ggccgtcaac aarcrrtctc ccacca t cac tggagtqtat 

a t ggcaa 1 1 g cgacggg t gc aaicegt tga 
tataacctat 
z ccg t gggcg 



gcattggtgg acagatcgtc 
ttccaccacc tttcttccc 



ggtaaaacag gct_a t c a c c i 
gccggactgg tatcgacctt 



^gccttacig acaaatcgag 
ncgcaagcc ggaLCCtigt 



ccagccggaa acctcgacrg cagcatgggt 



t tcctgggag ccg t gggc eg agga ^ gcgga acgcaaaccc taggtgacct 
catcggtttg cgccgcactc ttatgcaaat gctcattgac tcggttgtcc 
ctgtcgtcgc cattcaaaat gcgctgcctc cacaaacgag ccccatcggc 
ccatgttcgg ccagacattc ggtggctcgc tttttctcac cctgaccgaa 
gcaatggttt ggactctggt ctgcgccaat atgcgccaac cctcaatgca 
cagccgcagg ggccaccggc ttccgccaag tggtccccgc tcctctcatc 
tcttagcata cagtaaaggc gtggaccatg cattctacgt tgcggtcggt 
ctaccttcat cttcgcctgg ggtatgggcc ggcttgcctg gagaggctgg 
agaaaggacg gagegaatga atttaatcct ategtaggga aegecaaaga 
tttctatgga gatagecatg taccgtttgg caactcatac actctaccta 
accaaacgaa ccatttttat gacaggaaga gcaataatta atcacgaacc 
acagacccgt tcaactgtgg ccttcttttg gcgcttcatg ggtaaacact 
tgagattcct agcttgacct gagcagacct aaacctgeat tacctagtca 
cattcgaata cctgaca tgt - tctg t tacga atccacggca teggtatatt 
ggccttccag tatttgette tcttaaagaa agttccatgg ttccattgta 
gacggcttcc ttgtacacat tggccatcac aggtgatttg agcaagacag 
caacattgag atteggtate aagcactgat accacacaga actggcgggt 
ccgtcttggt agtgtggcag agaaategtg gcgtgtagca cctacgcttc 
tcgcgccctc agacccgcag accaactgga gtaceggact cggcgttttg 
teaacegggg ttattgtacc gccaacccgt gcagctacct agctcactgt 
taatgetagg etteggtctg acgatgtcat tetattgetg atgatgttcc 
ttatctacca tectggegag gtcgaatcag cccgcgtccg tcatactcca 
gtaaggaagg cacatccatc aaacatcaaa ctagaagaaa ttggccacga 
agatgcgccc ggtgcgcctg ctcccatagc tatggttggc atgggctgca 
aggegcaaca gatccccaga aactgtggaa attgetggag gaaggaggga 
taagattcct ccttcacgat teaatgtegg eggggtctae caccccaatg 
aggateggtg agtatgaagg attctgggtt gagcattttt gaggeccata 
cagaacgata ggcgttgact gcgagtagat gcacgttcgc ggtggacact 
agacccggct cttttcgatg cctcattttt caatatgagt actgaagttg 
tcccgcgatc gttgtccagt tgtgtatgga teagaagegg aataaaccca 
gecgaatagt gtatggaccc ccagtaccga ctcatacttg aagtcgttta 
gaagctggta tgtattatat tccttggttt cccacgtggg tattaactcc 
geagegggaa ttcctctcg.a. - acaggtctcc ggctccaaga ctggggtttt 
atgtatcacg actaccaagg ctccttccag cgccaaccag aagcccttcc 
ataacaggaa atgctggcac catgctcgcg aatcgegtet cccactttta 
gggcccagtg tctcgatcga cactgcctgt tccacaacct taacagcett 
attcagagct tgcgagctgg agaatctgat atggcgattg tcgctggcgc 
cttaatcctg aegtctttae taccatgtcc aaccttgggt gagtctggtg 
ctagtgatca gcattcttgt tgcacagaca atatgtgatg ttaactgtga 
ccagcttcct ttegtccgat gggatttcct actcatttga etcgagageg 
gtcgeggaga aggagtggct gega tegtet tgaagactct gcccgatgcg 
gagacccgat ccgcctcata gtgcgcgaaa eggcaatcaa ccaagacggc 
ccatcagcac gccgagcggc gaggeccagg agtgcctgat ecaagattge 
cccagttgga cccaaaacag acttegtacg ttgaggecca tgggacggga 
gagatccget ggagcttgea gtcatctegg ccgcgtttcc gggacagcag 
gctccgtgaa agecaatate gggcatacag aggctgtcag tggtctggcg 
aggtggctct ggctgttgaa aagggggtta tcccgcctaa tgcaaggttc 
gcaagaagtt gctcaaggac actcatatcc aggtagcatt atcttcacga 
tcattctatt ctttctattc cagctcctcg ctgatttaca aacagattcc 
caatcatgga taccaaccga tggtgtccgt cgcgcatcaa taaacaactt 
ggcgcaaatg etcatgeaat cgtggagcaa tatggcccgt ttgeagaaac 
ccacctaatg gttattctgg caactatgat ggcaatttag gaaeggatea 
tatgtgctga gtgccaagga tgagaacagt tgcatgagaa tggtttcaag 
tatgctaccc acgccagacc agecgacgat ttgcaattgc tegegaatat 
cttggttctc gtcgctcgaa cttccgatgg aaggcagtat gtaeggcaca 
ggtcttgccc agaatttggc gggagaaggc atgcggccaa gcaagtcagc 
agactgggat gggtgttcac aggecaggga gcgcaatggt ttgcaatggg 
attqagatgt atcctgtctt taaagaggee ctgctggaat gcgatggata 
atggggtcaa cctggtccat tataggtaaa gacccgcaac aagtccccgg 
ggaaagcact cactcatgtc accattgeag aggaactcaq tcgccctgaa 



t acctgcct a 
a tcct geegg 
g z ccccacca 
get ggtagt c 
gt cca ta tec 
catcra tcag 
gcaicgtttc 
ctcc l t tagc 
a tt uegctag 
ttgettttta 
cacgaggtaa 
tec egggtec 
gec t c tggag 
cgea tgcagg 
aata t taata 
ctct tctcaa 
agt tgtgacg 
tcagtgttga 
tggaaatatg 
caatggacat 
ategt taact 
tcgaggagat 
a tgaagcagc 
agagcaacta 
gtcggcgt ta 
ggagaa tgga 
t ggcacccct 
ggtgtgcgca 
tgacaccat t 
gat ttggcgg 
gcgcctggtc 
gecagegggt 
tcttcctgtt 
ttctcgacga 
ccagtgtacg 
tgctaagact 
tgaggeget c 
ccatggctcc 
t gcaggaacc 
aeggtatt t c 
tgaccttcgt 
gcatcttgcc 
gaacctgeta 
t tcaa tcca t 
tgtgctgcga 
gatggctatg 
gtgcgagacg 
cggaccccag 
ta tcagaagg 
accagaqcag 
a tacaggtgg 
agt t tgataa 
ctccagccga 
tttt ttcctc 
actgtgtagc 
cggtttcgga 
atcga tetge 
agegcatata 
gctgtgcgac 
agcatacacg 
cagcctcacg 
cgaccaagta 
tcgtgagt tg 
tat caaggaa 
cccaggc ta t 
aeggaaagtc 



276C 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4 4 40 

4500 

4560 

4 620 

4 680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 
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gtctactctc atcgtggaac 
cagctgcacc gtaccctaec 
tacacagtac actaace; 



cgccattgga 
cggcgtcggt acataagggg 



tgtccgccat tgccaagt; 
taaaaatcac ccaaacct: 



gcgttgatca gccagaartc agtctgcca t tatcracgcc tc::caaat: 

itccaaccag cagccgtcac cac:cactcc 
izc ggggcactaa caccccccic 
:ca agagaccgcc 

ctgt cgga t t gagccccac: gaagtgggca tatacarcac acaccntcca 
aagaatgcct ggtggtggcg egtgtcaaca gcccgtcgag cgtgacggtc 

gaggaactgc tccatgcuga tcgtatattt 
cactccagcc acatgaactc gatgacagat 
ccggtcttac agaactcttc ggagcagacc ccagtgatgc agcaaacgcc 
tgatctacgc ttctcccaga accggggccc gcctgcacga catcaatcg: 
ctatacactg ggtcgaatgc atgcttcacc cggttgagtt cgaatcagca 
tgtgcctgga cgaaaacgac cacatgccaa aggtcgatag ggtcattgag 
acggagcgct tggaggcccg atcaagcaga tcatgcagc: tccagagcit 
acatccctta tctgtcctgt ctttctcgtg ggaagagctc tctgagcacc 
tcgcatcaga acttatccgg gccggatttc ctgttgactt gaatgcgatc 
gcggatgtga agcagctcgg gtccaagtgt tgtctgatct accgccctac 
acgagaccag atactggaaa gagccgcgca tcagccaatc tgcccggcag 
cagtccacga cctgatcgga ttgcaggagc cgttgaacct gccgttggcg 
acaatgtgct tcgtgtgtca gatttgccat ggctacgcga ccacgtcgtc 
ttgttttccc tggggctggg ttcgtgtgta tggcagtgat gggaatcagc 
cgtccgacca tgaatctgac gacatcagtt acatcctacg cgacgtgaac 
ccctgattct acctgcggac ggggaagaag gaatagatct gcgcctcacg 
ccgatcagag tctgggttca caggactggc aaagattctt agttcattcg 
acaagaatga ctggacggaa cactgtacgg gacttgttcg agcagagatg 
cctccagttt gtcgaaccaa caacggatag acccacggcc atggagccgt 
cgcaggagct gtgggactca ctacatcggg tgggaattcg tcatgggccc 
acattacgtg catcgaaagc gacgggcgag ggtcatggtg tacatttgcc 
cggcctccgc aatgccacac gcctacgaat cccagcacat tgttcaccca 
actctgcagt tcaggcagcc tataccactc ttccattcgc tgggagccgg 
cgatggtccc cgctcgcgtc ggctgcatga agatttcctc ccgacttgca 
ccagggacat gctgcgcgca caagcgaaga tgcacagcca aagtccttcc 
ccgatgtagc agtttttgat gaggcagatc cggttggagg gcctgttatg 
ggctggtctt tcagtctctg ggggcaagtc tgggcacttc tgaccgggac 
ccgggaatac ttgcagctcc tggcattggg ctccagacat cagcttagtt 
ggcttgaaaa aaccctgggc acaggtattc aggagcacga gatcagcctc 
ttcgacggtg ttcggtgcac ttcattcaag aggccatgga aagtttgagc 
tcgagaggct gagtggtcat ctggccaaat tctatgcgtg gatgcagaaa 
gtgcccaaaa tggcgagctg gggccagaga gctccagctg gactcgggat 
caagatgcag cctccgctct agagtggttg ctggtagcac caacggcgaa 
gcctgggctc cgtgctcccc gctatcctac gtcgggaagt tgatccgttg 
tggatggcca cctgttgtcc cgctacta tg' tcga tgccct caagtggagt 
cgcaagccag cgagctcgtg cgcctctgct gccacaaaaa cccgcgcgct 
aaatcggcgg aggcaccggg ggttgcaccc agctggtcgt ggactccttg 
cgccggtagg ccgctatgac tttactgacg tctcggccgg gttttttgaa 
agcggttcgc gggatggcag aatgtgatgg attttcggaa gttggacatc 
cagaagcgca ggggtttgtg tgcggatcct acgacgtggt gttggcttgt 
atgccacttc taacatgcag cgcacattga ctaatgtgcg caagctgttg 
gcaaactcat tcttgtcgaa accaccagag acgagcttga cttgtttttc 
ttctgcccgg ctggtggctc agcgaagaac cagaaagaca gtcgactccg 
ctacgatgtg gcgcagcatg ctgcacacta ctggattcaa tggtgtggaa 
gtgactgcga tagccacgag ttctatatga ttagcaccat gatgtccacg 
cgactccgat gtcatgctcg gtcaaattgc ctgaagtgct cttggtctat 
ctacgcccat gtcttggata tcagatttgc agggagagat tcgcggcagg 
tcacttcgct acaggcactt cgtcaagttc ctcctaccga gggccaaata 
ttggagaggt ggaacactcc atgcttggtt cagtcaccaa cgacgacttc 
cctcaatgct acagctggct gggggaactt tatgggtcac ccaaggagcg 
ctgatgatcc cctgaaggct ctacacctcg gattactacg taccatgcgt 
atggcaagcg atttgtctca cttgacctcg acccttcgcg taatccatgg 
cgcgcgatgc cattgtcagt gttctggatt taattagcat gtcagatgaa 
actatgcaga gcgggatgga gttatccatg ttcctcgggc atttagtgac 
gaggcgagga agacgggtat gccttggagc cattccagga cagccagcat 
tagatataca gactcctggg ctcctcgatt ccctgcactt cacaaagcgc 
catatgaacc agataaatta ccggacgact gggtagagat tgaaccgagg 
ttaacttccg tgacatcatg gtcgcgatgg gtcaattgga atcaaacgtc 
aatgcgccgg cgtggttaca agtctcagcg agacagcaag aacaattgca 
cggtcggaga tcgggtttgc gccctcatga acggacactg ggcgtcgagg 



gcgcttgtcc 
agcggagagg 
ataagctaia 
ggcatgttgg 
t tacagagtg 
t cgggagatt 
gcgagacggc 
get 1 1 ccgag 
agtaaaga t g 
etteggga tc 
1 1 ccgt cgaa 
a t tggacctc 
gccacgtgtg 
c ttcgect zc 
aaccctcccc 
cct tggaacc 
eggaagggee 
eggtea tggc 
ggctegcata 
acgctctgct 
tt tgegcagg 
at t tgtgctc 
a teactgetg 
gaccagcctc 
aaaaeggege 
t t t tt tcgaa 
ategeggaca 
accacactag 
a t caaatct g 
gat ttggagg 
gca ttggtaa 
gagctcgaag 
tccaccgacc 
aaccccggct 
atat tggagc 
gtaggega t g 
caactggcgt 
agegagcagg 
atgatctgtc 
gaggt gatga 
cggtccaacg 
cgcat actgg 
ggcccaaatc 
gcagcccgca 
gaggacgatc 
caggtcctgc 
aagecaggag 
act t t eggge 
ccactaagcc 
gt t gaggctc 
geegtacagg 
gttgactcat 
aa t tgttccg 
tgegtattec 
acacttttga 
acaa tgaagt 
aatgaaagee 
acaggegatt 
aaggagtttg 
tccatcaatg 
ctcctgcgac 
aatgtggaca 
gcgtt tggtc 
atgggct teg 
cccgggct tg 
gtgaccacaa 
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gcccgaccaa 
ctctggcct z 
aaacggtc: t 
aat taaccgc 
t cga t aaa t ^ 
acggta tcaa 
ggcctctcct 
gcaagaagga 
ccttctcc:c 
cga tgtccga 
cagagtat cc 
ttgggaaga t 
t gccacta 1 t 
gacggcggat 
gaactgctcg 
t acaggcgtg 
cggaggaaa t 
tcgtctcgca 
g ttggaa t ct 
tgggtgtca t 
cgct cgcaga 
tccagtcaa t 
tcggcta tea. 
ctgtgtgt t c 
gcccaggccc 
agtatcgtga 
acaatct tea 
tggaggega t 
ccactcagac 
ggatcacagc 
tegecaaagt 
atatctctcc 
t tacagcatc 
t tgtgtgaca 
ggatgagacg 
gctcgtccgt 
tagctgggtc 
gtggtgctgc 
gt tgaagaa t 
t ttaegctet 
gttagtgtag 
gtct tgaggg 
cacggtgcaa 
aaaacagaag 
cccacgctac 
aaga tgetae 
ctatat tggt 
atcccct t tt 
tcca tcca tg 
ggaggagaca 
caccccaact 
cgaagaagag 
ccgccacatc 
acgagctctc 
tacttctgac 
etactegtat 
ccaccggatg 
cgcaactcta 
tagacacat a 
tcgaattatg 
aagtggaaag 
gccagcatca 
atccattggt 
accaageggg 
attggegaca 
tatcgttgtc 



eg t get gcoc 

CaCccCcCCI 



gatccacccc 
tgctgaagt c 
ccacct cga c 
gacccgcacc 
t cagaagagc 
tct tgagcag 
cat tga tat t 
ggt ca tcttg 
ta tgteggee 
t gt tgtgaca 
tetgaagee t 
ctgcgagtgg 
cgtggacccg 
tgatgtggcc 
gcctccgat t 
aa tgacggcg 
gcaccgaat t 
gggaggegea 
gcaccgca tg 
tgggtatg ta 
accct tgcac 
ccctgccgca 
tcactgggca 
teegt tgagg 
cgccaggc tg 
gagctgeaag 
a t tggcgggg 
taagt tcaat 
cgcggaagtg 
eta tagctac 
ccgt tctgag 
atgaataaa t 
aaggeca tgt 
agaceggaaa 
cgcagggtt a 
aacgtctga t 
gt tgeatage 
aggatctget 
gttaa ttagg 
tagctgggaa 
cccaccatat 
agaaaegcag 
ctgcagcgcc 
cgccgagtgc 
taggt gtct t 
cctgatatga 
gat acacccg 
tegcttaaga 
gccctctccg 
aaactcctga 
gtcacctcgc 
caacgcgccg 
teggaggat t 
gcggcaattt 
ggegtatate 
ctgctaataa 
ccagaacccc 
tcctcctcca 
teggagaagt 
ttgetcagea 
at t tctctcg 
ctgatattca 
tegtaatcca 
ggaataaaac 



a Liccagaga 

l3 Cc 1 1 t Cc C 

ggegcaggae 
1 1 tacaactg 
cctgat catg 
cgtggcaagg 
t t tgactgtc 
aa tagecgae 
ctctactggc 
ct gtgggagc 
ctggagaagg 
gtagcccccg 
aacgtgtcg t 
ctggtcga tc 
gtcgtgacga 
ga tgaaagee 
eggggegtea 
gaegggt tec 
geateggacg 
ggacaageca 
gctcacaacc 
gcagagacag 
gaagaggagg 
cccacacggc 
cacgccgact 
gacaatcatg 
aaccgtgcaa 
ctcatctcaa 
atcggcgtgg 
gttgatatct 
gtgetgeaga 
atatatat t g 
atgeagat 1 1 
cagageca ta 
a teagegcag 
atggttttat 
agaggecact 
tgaggatctg 
cctt tgagat 
gcagattacg 
ataggtctat 
tccatgtcgt 
cgcccgcacc 
ataaagaaca 
tcccaattt t 
geagceggaa 
aagtgatgt t 
tgtatattta 
tct tcactct 
attcctgegt 
cctacgcccc 
cccacaaaca 
caccgctcat 
agccgggttt 
cgactccata 
gtccggcgtt 
ccatgaat tg 
gt tagcagtg 
aaggcactct 
taataagacc 
tgtcaatgee 
aatcctgggt 
ggttgaagta 
agttatatga 
tgtaggt tgg 
gcctctccct 



ctcttagtt z 

1 1 zacaccqz 

gear aggeca 

ctggcagtga 

tct t ct cgag 

gggt ggacgt 

tggctaggt t 

tcgaca tgtc 

ageaagegaa 

gaaccgcaa t 

cct t tegcac 

a tgacaeggt 

atectgttgc 

gcggggcgcg 

gtct ccaaga 

agct tgaagc 

tccaaggggc 

atgccgccct 

tggatttctt 

actacgcggc 

agecageggt 

attctgetgt 

ttctggacgt 

ctgctgtcat 

ggatgeaaga 

gagctt tgtc 

tcagccaaca 

tgttcggcct 

actccctggt 

cagt tttcga 

gatacaaagc 

acatgcctcg 

gtt tettgee 

geca tgcaag 

caca tct t gc 

tatcttaggt 

gceacaatcc 

tgtaacatct 

gtatgtagag 

ggttgaaacc 

gaagctactt 

cgggccatac 

cgccacgctg 

aeggegt tgg 

cacatgggtc 

gggatttggg 

tatcgt eggt 

tatgtaaggt 

gtacct tatg 

cgccatacct 

ctcgcgccaa 

caccttcgaa 

cacatcccca 

t tctctccac 



taccgacacc 
t teggeggga 
ggtaegtatt 
ctatt ttcga 
c taaaeggge 
aaattcgett 
aagtgaaaag 
tgcatcctca 
ta tgtcccag 
agagceggtg 
ctcgttccat 
taaat tctgg 



cccgca t gcz 
tgcccgca 1 1 
ggeggecat t 
gaecaagegt 
cagggactcc 
ggtt ttgaac 
tggt cggttt 
gaegt teg tc 
gcccgctgaa 
eggectgat t 
ta tgeaqage 
ectegt ccgt 
tgggggccng 
atatctcatc 
gcggggctgc 
ggctct ccaa 
aatggttctc 
gcggcccaag 
cgtgatgctc 
tgccggagcg 
caccatcgac 
ggeggaaega 
cctcgagcaa 
cgtcaccggc 
ggctcget 1 1 
gctgaccccg 
ggagtcaa tc 
gaeggatage 
cgeca ttgag 
gttgatggag 
t tagata tat 
a tagtgtctg 
tagcgtgaat 
cgtaatccta 
tgtctct ttt 
t tcccta tea 
aggt taatag 
acggatggta 
t tgtttgggg 
ccccagacgt 
actacttact 
tt tgggaaac 
cagecagagg 
at tectgeaa 
eggeggaacg 
acaeggagge 
tegcacagag 
gccatctcca 
get tcagcag 
cccgtcaa ta 
ccctcggagc 
atcttatcaa 
catctggccg 
gacaagggag 
gacacagacg 
ga tgcagagt 
accgacatat 
ttgat tgtgg 
ctatctagta 
tggctagtca 
teggggaaac 
gaaggagtcg 
acctggccgt 
ctagt tgtgc 
gagggcgttg 
aggaggaggg 



gee tcca tec 

ctgccaggtg 

at tettgetc 

aacct tttga 

agettegteg 

t cgctagctg 

gtagaaateg 

cgcaa tgtct 

atct tccagg 

ca t ccaata t 

ggccagcacg 

cagga a cgaa 

gg tggta teg 

at tctgtctc 

accgt ttctg 

cagtgtcggg 

aaggacgccc 

gttcagggaa 

tcatccttgg 

1 1 tcaggacg 

cteggaatgg 

ctccaacgga 

gcta t atctc 

a tcaacaccc 

geggggatea 

gcggaagatg 

gccgtga tea 

gaaatgtccg 

ctccggaact 

ggecgaaega 

atgta tatgc 

attttctctc 

aegtcacttg 

tagagtcctt 

at t cat taat 

ataaga tcct 

tctcaacatc 

aggaaaat tg 

cagg ttcggg 

gctgggttta 

aaggtttgta 

actttagaga 

gaatagaaag 

caagecttgt 

aacgtgagac 

gtgtctaacc 

t tacacgaa t 

ca tacatcca 

acaagaa tec 

t caccaccgc 

ccaccctcat 

ga tgggtcta 

atgtcgccgt 

atactacata 

aggagttact 

ateaaegtea 

atcct tgcaa 

gttgeaaage 

gtctctcata 

tgtcaatgee 

caaggcat tg 
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ctccgggtac 

tagtggacgc 

gact tcgtga 

caa tatcget 
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acagcgactg gccacttggc ttaatttgta cgaganagat gcagatgacc Cettccatgc 14640 
cgaacggttc gtcnatatgc cgaagaaagz gagtatcccc atgtcctgtg aatggatcgg 14700 
ctggtct tec tccacaatgt cgacaraga t icccatgaat gaga rcaaag aatacttcag 14760 
gacaatgcta tgaagtacat ccccaggtta ggatgctgtc aagagggtcg cacggaagac 14820 
ttacctgtcg aaactgctga aaccagattt acacccatcg cacaattgct tgaacagcat 14860 
cagacaatcc cgagcgtgtt gtaggcagac agcgcccctt tcctggctcc aaatgcgttt 14940 
atgcagaagt acccaacagc tcgagagtgc aaacctcaac tgcgttgcac agatatactc 150C0 
ttcgagaggc ctattcggat ctccggtacg cagtacangc tcatgttgca cgtgccactg 1506,0 
gctaagtttt cgagcaattc tgcggatacg gcggcggacg cgactctgtt cca t n - t get 15120 
ggattttgct gagtataagc ccagatatat ctccccaaga atgaatgcca agcg-atgcg 15180 
tgccgcaaaa gcatttggta gtggcgcagc ggagtcatat ggaggcaatg gcaccctgca 15240 
gtcatacgag ggcaataggc aggccttgcc cccaacgatg gatacgtgct tctaaatcct 15300 
atagtaagcg aaacgctcag aaaacgaagc aagt tt ttga cgaacatcga cgaggaagat 15360 
gctccagaac aagtctttag gctggtcgcc cacagcccca tcttccgttg gcggggtcgt 15420 
tgtcgttaaa tggagtccaa tgagcccgga caactcgcag acttgagcaa agatagtcag 15480 
aaaaattgcg aaatcaaaat actccattgc cacaagtgcc ttagtataat aaagtcaggc 15540 
cacgttccaa aggtagggtt gtaagacgca ccaaacaaaa gagggcccgt atattcgata 15600 
gccggggttt aacaagagtt tccaatcgat tatagcactg cctgatatta cgcagcaggc 15660 
ttatgatcat ataatctatt ggtatgtctt gt-cctgtgca tcccacgact ttgttcgcaa 15720 
ttgatgtttg agtcagggcc tgaagcacaa tgcagttgaa agaggtaatc caaaccaggt 15780 
ccggaggccc agttctgatt gtgtactgag attccactgc gtccataata gtctgtcgac 15840 
taaatatagg gagtcgtgga tgcagcttct tgaaatattc gttgatagac ggctccacta 15900 
gagcgcgggg tggaagagac gggagaaagg tatcgttttc ggatgccgat agtg — ggga 15960 
tatcctcaat gatctgttga agcgtttcat tcgcctcagc caccttcgag cgcacagctg 16020 
acacagcttg gcggtcaaat gcgaccatct ctgcgacttg gccgctggtc atatcgaccg 16080 
gtaaccaaga ggttaggttg gcgttagcag cctgtgcctc catgacgagt gaaacaaaag 16140 
agccataacc gtggagaggc cagaatccat cttcctcggt gggtgaagtg tgtttgcctg 16200 
agtacaagtt tacttcacac tcttctgagc tggttgagct tgtgtacccg ctttcrtgtg 16260 
actgtggcct tgatagatca cgaaagccct gatgggcatc aataaaggac aggcatcgtt 16320 
ggagggcatg ttcgatatct ccaacccgac tcccgagctt ctcgacgtcg cgcgccctga 16380 
ctgagagtca gaaagacagg tatttgaccc aaatatagac catcttatca cttacaattt 16440 
tgctgtttgg ttgacccgtg taactttgtt ggagaattca cagggtaggt tatatcggag 16500 
gcaatgggag cactttggac cctccttgct gcatcggacc ttacggatcc ggcactcatc 16560 
acatgcagga aatgcctgcc gcttcgacga cagctctgca ctctccatca ttcaactcta 16620 
tgtgagatag aacatgagtg aacgatgagg gaaggccctg gtactcagat aaqtcettga 16680 
tgtctctgta gatctcaact gtgcaagacg aagactggtt tgaagtaagg cccacagctg 16740 
aagtggcatg ataagttcac ctgttggttt aatatacctt attgttgatg aaagcctgac 16800 
cttggccaaa gaagggagca gctgagtatc cgacatattt gataaggacc agcggcaagc 16860 
ttgggatctg ctgcatactc ggattacgcg aaaaggtagt cggagtacgt gaaaaggtac 16920 
tcggagtacg cgaaaaggtt agattaagcg ctacggacgc cggcagtccc tgaatgaaac 16980 
tctgcggggc ccacttgccg ctgacggtgg cacgatgaac cttctggccc ctactctaat 17040 
atagcacaag gtgctcgaat aacaaagctt gtcgccatcg tggcgttgac gccagcatgg 17100 
ctctgcattt gctgtactat acggtaagct ■ tagtagggcc aaatttatac tatagtatgt 17160 
tatttccgat ttacgtacgg aatgggaaag aacttgtatg cggccgcagg ctgaaaaggg 17220 
ccgaattaaa gaactgccat aactatgtgc atcccgatga tgcctcaagc gcagcattca 17280 
acaggggcaa aatatcgtca caggtcagat ttattactac atcgggaagt cttcggagat 17340 
tgattccgag gcatagagtg ccttggacta gcgacgccaa gtttctacta tcaggaaaag 174C0 
tgctagttcc agattggaac tagccaataa gtagccagcg gtaagctgaa caaccaattt 17460 
ttaaatatta gatgaacaaa tgcgtagttc agagacacca tgcacgttgt ccaatactat 1752C 
gaggctttag caagacaaag tggagcacac gccacctggt aaaagtgaac accgcctgct 17580 
gcaaaagctc gtggagtttt attttgatta ctagaaagca gacatcccct ccttgtcccg 17640 
gaatattgca aatcagggaa cattgaataa gctgggccgc taatgcgtgc taagaacagg 17700 
gagtccacca gagtgaggta gatcccttcg tatttcagct gtggcaggac agcgatctgc 17760 
ggagctccga cggctagccg tatggcattg acagtgaagg agatcggaac ctctactcca 17820 
gagtacattc cgaaaagcca taattattcg gatgtacgga atacggaggg atgataagcg 17880 
ctagaaatag ggtggatcct actaaaattc ggatccttta gcttgtgcta acgtgtcaag 17940 
ctggggcatc ctgtagtccg tgctactcct ataggtttac cggctatata aatctggtgg 18000 
agccagcggc ccctgcacct ctttaggaca cgcggcccct tcaaaaaagt caaaatagca 18060 
gtcaagagca gtgaaagaga cccatgctgt ctcatagaaa agcaaggaac ctcgaaatcc 18120 
acatgagatc gatccacatg gttctacttc aagtgggtca ttaattcccg tcacctgaga 18180 
catgagatac ttagtagcta agatatggta catgctgttg atcaaacgat atgagtgtca 18240 
tgttggggtg aatactaagg aacatttcta ccaacaatct cgcggtgaat agaaaactga 18300 
cctttaaatt ccagcacggc gaacgaacag catccatatt tcagtttggc tgcgccaaga 18360 
acttgtagac tttctcgtac cttgacgctc tgtcagtgta caagaatatc atgcctgtag 18420 
attactccta aacttacacg ggaaacatga tggccgaggt aagggaaagt ctccctgtcc 18480 
gaaaccagac gccggaccag aaaaccccaa tgccttcatt gcgcagaagt gtttccacgc 18540 
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agttaaaggt atttccgtac agttgtcttg cctgaagaga ttgcattctg gtaacccgaa 18600 
gtcgcatcca agtcagcata racctatctg ggcgggttac tgtcgatctg agcagcactt 18660 
accgcgtctt gaccacgtcc agtggctgtg tcgaccaggc gcaacaaact ccagtaacag 18720 
aaccgaccaa ggtgcttgcc agagggtgca cgtct tcgcc gttcttcgag tattttcggg 18780 
ccagcccaat aagttcgtta taaacagtga acttcactgc cgcattggac gactgccgca 18840 
aaattgtagg accaaccqca gagaagaatc caagcggtcc ccgatctcga aggattccag 18900 
ctatcgcgcc gaaagtcgta cctaactctg catttccaac cttcct tgca ccaatgctaa 18960 
gagaacgaat taagaactga tcggataacg ggcgagatgt tgcaacttac atttccgtct 19020 
tgatcgcctc cgctggggtt acggctaaga cagcctcggt cacgccagcc ccaaacccag 19080 
ccaggacgga agctccagtt gagagctctc catttgggcc cgagagggcc gagcgataaa 19140 
tattgaagga ggcaaattct gtttgttatt gcaaacggtc attccttttc gctccacgcg 19200 
ggcttataag ggggatcgct acatacgaac ggaggctttc aatgtggttc ctaccaaggt 19260 
ggctccatac ccagcatacc agcctcggat tccaggtttt atagctgcca catcatggtt 19320 
tctccgctta agctgggcgc gagttttagc cgctaccagc ggaagcaata gactgtcagt 19380 
tggcggccac ggtaaagcgg aactcttaaa cacaggataa gctcacattc gaaagggtag 19440 
gtgatggaga tttcaactgc cccagcacaa gcacccgcga ccaatgcagg aatgcctttc 19500 
gtctatacat atattaagag aagacattgt cagtaacatg gcacacgcgc gaccaagaag 19560 
gaacacatac acgcttccca cgggcttttt gggtaagggg tgcctttggt aatggaacat 19620 
ttgtctggac ,t-t tagactcc atgacgagga tgtgctgagt ccaaacaaag cttttcttca 19680 
cagagatagg gctgcagacg ttatttccag tttaccttcc ctgtgttcag tatcaggtct 19740 
tatattgtat tatctcaatg cttatgactc taagtggaat acattggata tcagtttgtc 19800 
acggagtcgg cacccgatgg ctatcgcaat cgtccctggt gggtcttgaa atcgtatgtc 19860 
acacttattc cggatgaaac aca,t tccgga gcgcgcgttg atattgctaa acagtataga 19920 
cccaaatggt ctgcagaagg ccctaaatag taggtctcat tagccagtat ttagctgtga 19980 
ttgcagatca ttgtcagcct aacatcagtg taggttacgg tgtgatattt acttgcatag 20040 
aaggttccag accacacggt tctagatcct ttgacagcag catgaatgga ttcccctcta 20100 
ggtgccgggc gccgacgtgt gcgttgctcc gaaatttgta ggacggagct cggataccta 20160 
gccgctatgg gcatcggagg ttgtagcagc gtacacgctt ggatagttaa ataatcggat 20220 
gtacacccac tgttggaaat gacgggggcc taaaacacga gattatctga tccaatttct 20280 
gttcgttggc attctatcat tcgcagcgaa gatcgtcctc ttaaattgac catgaccaag 20340 
caatctgcgg acagcaacgc aaagtcagga gttacggccg aaatatgcca ttgggcatcc 20400 
aacctggcca ctgacgacat ccctccggac gtattagaaa gagcaaaata ccttattctc 20460 
gatggtattg catgtgcctg ggttggtgca agagtgcctt ggtcagagaa gtatgtgcag 20520 
gcaacaatga gctttgagcc gccaggggcc tgcagggtga ttggatatgg acaagttagt 20580 
tctatccaat ctgaacagtc tacaaagtat actgacgatc ctttgtatag aaactggggc 20640 
ctgttgcagc agccatgacc aattctgctt tcatacaggc tacggagctt gacgactacc 20700 
acagcgaagc ccccctacac tctgcaagca ttgtcctccc tgcggtcttt gcagcaagtg 20760 
aggtcttagc cgagcagggc aaaacaattt ctggtatagc tgtcattcta gccgccattg 20820 
tggggtttga atctggcccg cggatcggca aagcaatcta cggatcggac ctcttgaaca 20880 
acggctggca ttgtggagcc gtgtatggtg ctccagccgg tgcgctggcc acaggaaagc 20940 
tccttggtct gactccagac tccatggaag atgctctcgg aatcgcgtgc acgcaagcct 21000 
gtggcttaat gtcggcgcaa tacggaggca tggtcaagcg cgtgcaacat ggattcgcag 21060 
cgcgtaatgg tcttcttggg ggactgttgg cccatggtgg gtacgaggcc atgaagggtg- 21120 
tcctggagag atcttacggc ggtttcctca aaatgttcac caagggcaat ggcagagagc 21180 
ctccctacaa agaggaggaa gtggtggccg gtctcggttc attctggcat acctttacta 21240 
ttcgcatcaa gctctatgcc tgctgcggac ttgtccatgg tccagtcgaa gctatcgaaa 21300 
accttcagag gaggtacccc gagctcttga atagagccaa cctcagcaac attcgccacg 21360 
ttcatgtaca gctttcaaca gcctcgaaca gtcactgtgg atggatacca gaggagagac 21420 
ccatcagttc aatcgcaggg cagatgagtg tcgcatacat cctcgccgtc cagrtggtcg 21480 
accagcaatg tcttctggcc cagttttccg agtttgatga caacttggag aggccagaag 21540 
tgtgggatct ggccaggaag gttactccat ctcatagcga agagtttgat caagacggca 21600 
actgtctcag tgcgggtcgc gtgaggattg agttcaacga tggctcttct gttacggaaa 21660 
ctgtcgagaa gcctcttgga gtcaaagagc ccatgccaaa cgaacggatt cttcacaaat 21720 
accgaaccct tgctggtagc gtgacggacg aaacccgggt gaaagagatt gaggatcttg 21780 
tcctcagcct ggacaggctc accgacatta gcccattgct ggagctgctt aattgtcccg 21840 
taaaatcgcc actggtataa atgggagcga tttcatgcca cgggcacaaa tcctagggca 21900 
tatcgtacct gtatgatgga agcaccagcg gtttagcaga tagatgatag gttccttctg 21960 
ctctgcgttg cgttttgaat ttagttactt cgctggctta agaatttaga atgaaatgca 22020 
gtctctctta ttccttatta aactcacgta ctcccacatt cggcgactgg aggatacgaa 22080 
agcagtgttg gtgatgtttc ctgtaatgga tatcattttg ctgactgaat tattctatga 22140 
cctttccctc caacggcgtt cttatctcga cactttagat gttgacgctg ccttgaggaa 22200 
ctagctttgc gctgcgaagg ctatgagcag tggagctgca tcctttcgcc tagatatcca 22260 
ttctgcatag atccaaggca gggcttcgta agaaaagttc acgttcactg taagtccatg 22320 
caagcggaac ggccgcttaa acaagtctat acagtaaagc ctgcctataa gcaaccgccc 22380 
atataaggaa tcccgcgata ttagcatcta aaatccgcgt ctgaattgat tttctatata 22440 
aataagcagt aaactgcttg aaaaagccct gctctcctat acaaagctac cttaattaga 22500 
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aaatataggt tgactagcta aaaatgtgcc itacaatatc qtattattat ataatactta 22560 
tatgaccacc ggaggtacgc tagaaatata tatcgtaaac agatcacccc itagtaaaaa 22620 
tatatattcg tatagacctg gctqtaagca aittcttat: ataagtaact tttmggtgag 22680 
ctgaattcgt tgcttatagc caggtttgct gtaattgata aaaggtgcca attcatcata 22740 
atctatcccg catcggatga attgttgacg atccacacca taaactgcat tatgttctac 22800 
attttcctca ttggtatcta ttgggtaggc aggttgaaag ccccrtctgc ccacctttgc 22860 
atatttatcc cccaccgtgc gacggccgcc gtgatcaacc cagctatagt cgaagcaacc 22920 
acacctgcaa caactgcgag catgtacatt gagaggtaac aagaaaggta ccacacatag 22980 
cgtgaacgtc catccaaczc cgagggcgtc tatgacgggc acaacgagcg cactactccc 23040 
tgcagaaaag gagtatcgaa tcatatactt tcctgcaacg actgcagacc ggtcccgtgg 23100 
caaggcttct acttgcggtc agagacaaca ttattgtggg catgcacaca aaggggataa 23160 
attaatgagg gtgtggaaca aaccagccac gtaagtgttc aggcagttaa aaccgcccat 23220 
gagcccccag cccgcgaaga acgccgcgat tatgggcact accatcccac ccttatcctc 23280 
ttggagtgtc cacccgtaaa tgagcgttcc cgcaggcagc acggcaaaca atgtgatgag 23340 
cccgctgtgg agtcgatcct gagggagacg gaatccgcgc tttactatgt atctccgaac 23400 
ggtgcgatcc gaaagtttac cgccgacgag actccctatc aggaacccgg cacctggagc 23460 
gaggtagaag agacccgata ctagggcagt cgttaaatga aaccgtgagt tgaatatagc 23520 
acgagctgaa gtcaggatcg aatattgcgt aatcgccagg aggccacagc ataagtcctt 23580 
ttgtttgcgg catgagcatt tcgtttccag tagatgcaga gggcatatct cccaggcact 23640 
tacggcaaga aagacacttg gatacaccca ctgcttgagc acatccgttg gggagaattt 23700 
cgatatgatt gaaacaagtg tggtcggttt aaacgccgtt gagaccttct cagaagttcc 23760 
ttcgattttc gggaaaaata gcagggaaag cacgagcccc agtccgctca tacctagttg 23820 
aagccagaag ataacacgcc aactcgtgaa agtgacgatg acccctccca cgcaggg'gcc 23880 
tgatcaaaca gtcagttcct gttgggagtt ctagtacttt cagcagggta cgtacctatt 23940 
gcagggccag aaagagtccc ggccatgaag aaacctacgg ccgtcccacg gtaaacctgc 24000 
gcacgaacgg attagttttc gcaattgggg agacaagggc gtgtgatgcg tacaggctca 24060 
aagatatctg caagaacagt ttggcctgag accatgaacg aggttccggt taagccgctc 24120 
aatactctga acgctatgaa cattttctcg tttatcgccg ctgccgttcc agcggagcac 24180 
gcacaaa.gca ttgaaatggc cagattgtat gatgtccgcc tgccgactaa cttgttcatg 24240 
ggaccccata tgagggatga atatcccatg gcaaccaaga caccagcatt ggagatattg 24300 
atagtctcga cagtcatatc aaattcattc gcgatttcag gggcggcagg aagaagacaa 24360 
gtactggaga aagtaacgac tagagtcatc caactaacaa caaacgtaat gacacatttt 24420 
ctccataatg gaatatcacg gccctttttc tctggctcgt tttgtccgct accttccgag 24480 
gtcgtcgctg ggttcgggga ctcagtgtca ccgcggccca tttctgcaat ggatggcctt 24540 
ctcgtcactg agcggactct tcaaaagaaa atatagaaca tggtcaccca aaacatggat 24600 
gttcctgatt gacggtttat gtatactcta tcatcggcct gccacacatt tcgccgagac 24660 
tacgttgata gtgtattgga gcgccgctag accttcggcc ttgctggccg agacgggtac 24720 
ctaataaggt agattgtgtc tttccgcaaa ggtattcatg tggatcagtc tagaatggct 24780 
gcaaggccca ctgacttgac taatgcaagc caacgccagg atagttatat aaaatccaat 24840 
gagggatcaa catcgggatc gacatcggga ccagcacccg gcatccacaa ggaccggatc 24900 
cgaaaagtac ggccggccga aaggaaaccc atggaaggtg catagacctt cactgtacaa 24960 
tacagtactc tgtggacaat gatattggcc accatctcac ccgtccgata atccatgtca 25020 
ttgatgactg tatttcaaaa tagattggat atctgagaag tgatgaagtc tatagtatgc 25080 
tctcttctgt atgagggatt ttgttggttc aacagggttt atcattagtt tgtcactatc 25140 
tggccttcgc cgtacaggaa tgcaccgcat gtgtttgtac tctctgtgac tccctcgaga 25200 
ctaaggacat acggtacgta tattggctaa ccaccattat taaagaagcg gtggccgagt 25260 
gagtacctta ataaggcttg tatgccctct tagtagttgt cggataattt ctccaataat 25320 
agggtggata caggggtaac ggcggaacgg aaagaactcc gccgtcggcc tgcggcgcgc 25380 
ggcccacctg cacaaccagg ctgaattctc ttgaattctc tttttgatac cggaaggcgt 25440 
gagaaggagg ggaaattcat catcttaagt gctcatctta tatctgcttg acgaatgcag 25500 
attgcccagt gcgctcaccg gagcccgagc aaaggggcta gagcgggtct gggttacata 25560 
acgacagtgc gacaccctct tcatgtaaag atctcgcctt attcactttc cacattttga 25620 
acatccgcaa ctgtgccgta cccagtctcg tgaagccggt aaaatggcta aattcgcagt 25680 
aaaatgcctc gatcacttgg tcctgactgt gcggtctatt ccaagaacga ctgcatttta 25740 
taccaagcat cttggtatgc gacatgagac gttcacctcg ccacttaatc gaactatcca 25800 
gaggtattcg cgcggaagtt gtgatctaga ttgtcgcgat tcggcttact gaacagtcaa 25860 
actaggcatg cgctcatctt tggttctcag aagatcaacc tccatgagca tactaaggaa 25920 
tttgaaccaa aagcacgtaa tgtgcaaccg gggagtgcag atctgtgttt cttaacagat 25980 
acagacgtct gccaagtgct caaggcattt agggacgccc agattgaggt atgtgtagca 26040 
aatgcttcta aaagcgcaag cactgaaggt taggtgcagg tactagaaga ttccaaagtt 26100 
gtggatagaa caggggcaca aggcaagatc cggagtgttt atgtgcggga ccctgatgga 26160 
aatctagttg agtaagtgtt cttgttattt aacagatttc ccttgcttct aacatgtgtt 26220 
aagagtatca aactatgtct catctagcca ggctggcggt caataagctt tacaaggtta 26280 
tatgacccgt aactgagttg cgcctgtacc gtgattagag acaacattca ttgttccatg 26340 
ttggcctcgt tctgtcatca gttggttaca cactggctta taggcaacat ttcgcgtaga 26400 
tttaggtagt tatttgctcc gcagttccgc ttgtaatcaa agaaaaccgg gttctggaga .26460 
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atctgtttca t ttcaggcct cggcctggac agigacacct gaggttctga tgggatatgc 26520 
tgggtttagt ctcc5cc:i5 aacatagtcc acctctcaat ggcgtattgg ataatatcga 26580 
aggggatctg agttgctagt ccaacztaca cczaaaqtaq tcttgcaggc catataatat 26640 
tcgacataac tataccaccc cagtgatgga aatccacaac ttatgatact tccgaatgaa 26700 
cgtgtgtctt tcgtgtagat aagtccagtc cataaaatcc aatatacctc aataaaagct 26760 
tcgtaaatca tccttgcacc agcaccctcc ccccgccccc tttgtttccc acgccttacc 26820 
ggcatactca tgtaacctac aactcctttg tccgctcctc actctctgca atccacczzc 26880 
tcaaactggt ctcatccccc ggcttaaaac caaccttgaa tggcttcggt tccgttgtca 26940 
tcgatgtcgg agtagcgttg cactcaatag catcaagcac tggcatatcg gccgggtttt 27000 
gagctggctc tatggtaaac gccacgataa accgaagaaa gaccgtgtat agctcgcggc 27060 
tcgcgaggtg ggaggctgcg cacatgcgcg tcccggcgcc gaagctgtag tgaggggtgc 27120 
cgaagccttc gctcggctcg aggtatcgct ccgggagaaa gcggttgggc atgtcgaaat 27180 
gatcttcgtc gtagtttgct gcccacgcat tctgacagat caattagtcc aggaatgacg 27240 
ggataggacg ccgggggtta ggggttaggg ggattgattt taccatgaaa aaggtcgttc 27300 
cagcagggat cctcgcgcca ttatagatga ccucct tgat attcacacgc ggaatgcaga 27360 
taggcattac ggtccagaag cgcagggtct ctttcacaag tgcagtgata tagggcactt 27420 
tttcctccac tagacaccgc tcccaggcgt cgccattcgg atacacggac ataatttctt 27480 
cgtaagcctt ctgctgaatg cgttggccgt cttccgaaga caggtacgcg atgcccatga 27540 
tcaagttgcc aggaacggta tcaagtcccg cagaaaccat ggtcagacag atagacttga.. 27600 
tttccgctga gaaaggttag tatatataat cacaaatcat atagtggtgc atcgctcgtg 27660 
aaatagacat accatctgta agcttcgtct cgggattctt cagaatattc cccgtaatgc 27720 
acggcttatc tgtccccttg gccatgcgat ccttcaaaat atcaaacaaa aaggccatgt 27780 
acttatctct ccgcgcacgg agatgcttcg cctgattgct tctgttcgag aacaacctga 27840 
gcagaggaac gtaatcctgc cagttgttgc tggttgaccg gagattggcc acgccgcgct 27900 
gcacctcgca gatctctctg agaagctggt cgttcacatt gccctcgatg cggtagccat 27960 
agttcagagt taaactggtg ttgagcgcaa accgctggaa gtagggagta gggttgatat 28020 
ctatcttccc gccctgtgaa tctttcagca actccttaat actggccata ctctccagat 28080 
cgattatggg catataggac tgcaccgcga cacggttgag cgctgtcgca gccgccttac 28140 
ggcgccgttt acatgactca tcccacggtg acgttccgat cgtaaagccc tgtgagctag 28200 
agacaacacc atgaaatgtg tgaaatgtag ggcgcgatat catcgaagac tgctccttaa 28260 
tccacagctg gcgggtggac tcgaaggtgt tggcgaagat gacgcgccta ttgcccagac 28320 
gcgcctggaa cacaggcccg aattcctttg accatttgcg agcgactgtg gcgtgtttga 28380 
cacccagttg tatcagatta ccgaagatgg gcacgcctgg gatctcgggg attcccttaa 28440 
tcttagggat gtcggttcgg ttgaagtagc gggtgaggaa gtagataacc gcggtagcgg 28500 
cgatgacgat gatttggaga gtcatagtgg cgagatgcga atggtattga agaatagata 28560 
gataattctt ttactcaggt tggcatggat tgtggcccgg gctttatact tcaaccctct 28620 
atcgacatca ttccttaaag accaggatat tccgtcatag taacggcgat agtgacgcgg 28680 
ccgcgttctg tcagcccgcg atcagctctg tcatgtggcc aatattctga tctacattgg 28740 
tttcagactg atggtctggc tcgaatcgaa gcttcaaagg gctctcacaa cgctgcgttt 28800 
ccgattatcg gtcctatctt ggtgcctacg gacacggcgg ccggcgacga tcgagcggac 28860 
ggccggcacc cggcgatgga tctgatctgc cgcacatctt aatgataggt attaatggac 28920 
ggctgatcta accttataag ctataacact tatggagagt cgaatgcaga gttgaaggag 28980 
acgttggaag acaaacactg tagtatggcc gcctctaact agttcactga caagggtgct 29040 
gtaacaggca tagactgtta ggacggatca acccgaccta agctgaccca ataccaagtc 29100 
tacggggtac tttctgtaaa gaggctggag accgcgacgt tcaattattc caatctgttt 29160 
ccaccactat cttatatgta taagttgtct tccctcggtt aacttgctct tcatgttaca 29220 
tcttgtcact aataggcatt tgatgtttga tttggctatt gactattgac agaacctcct 29280 
atgaatttcg cctttcagtt gccgtcggtg gttgctgtcg ggccgactac ttgttgccgc 29340 
tggtgattga atgctactta tatatatcca tgtctttttg tcggctttta tcacgccact 29400 
gccgccggat ttatccggta gaccctccta tcctgtcccc aaagcggggg aatgcgtcaa 29460 
gatcttcggg gatgaaattt ccccgcatcc agctccctac ctatactaca cagttaccaa 29520 
caattggcaa taaatagaca aacataacga acaaaaggca gctagaccgg atattaccgg 29580 
gtaaacgggt ttgtaaattg gacatcctgg ctctctccag agctacttaa ggattgtctg 29640 
ttggagaagc caaatgagaa agaggtgctt gtattagctt ccgaagcgcg tttctcgata 29700 
cctacctagt caatgcgcct ttcaaattgg ggctacatgc ttcttgaggg ttttatcagg 29760 
accaacttcg atctgtccgt tgcaagaacc aagaaacctt tccgccaaca aagcaaaact 29820 
tgaccatgaa gcccgcaatc cttatgaaat actggctctt cgtctcagct gtgagcgcgt 29880 
caaccctgaa cggcaagctc acattgagtg agacaaaggt gacgggggcc gttcagctgg 29940 
cttgtaccaa tagtccaccg gacatctata tcgaccccga tgattcggtc tcagtggttc 30000 
gcgcagccca cgatctggcc ctggactttg ggcgcgtctt tggtaaaaat gccacagttc 30060 
gcttcactaa cgagactcat ccaacatcga tggccatcat cgctggtacc atagataagt 30120 
caaccttcct tcagaggttg atagcggatc. ataagctcga cgttaccagc atccgtggcc 30180 
agtgggaatc ctattcatca gcactggtgt tgggtccagc caaaggcata cagaatgcgc 30240 
tagtcatagc tggcagtgac cgtcgtgggg ccatctatgg cttatacgat atatctgaac 30300 
aaattggcgt ctcgccattg ttctggtgga cggatgttac cccaaccaaa cttgatgcca 30360 
tctacgcgct agatgttcag aaagtccagg gtccaccgtc agtgaagtat cgtggaattt 30420 
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tcatcaacga 
acaacgggga 
gcctgaaagc 
ccaacaa tgg 
tgacggttgg 
gagcaacaaa 
t caatgttaa 
acacaatggg 
aagaaagca t 
tgtcgaacgt 
Latcgtttgc 
atgactgtac 
cgtctccca t 
gacatgaacg 
tgggatcaac 
ggggata t 



cgaagcgccc 
ccctgcct tc 
gaattacctc 
cccact agcg 
gactccctgc 
cgagcaa tec 
agcat t tatg 
ca tgcggggt 
tgttagctgg 
ggt tcaacca 
tgactgccag 
cagaccaggt 
tgcagaatga 
cgccgccgcg 
rgcgcgcggc 



gect tgcata 
atetcaegtt 
tggceggega 
gactactacg 
tcgaaagccc 
cagt t tctaa 
agggagggtg 
etaggega tg 
caggaa tccg 
t 1 tg tcctat 
gtctgtgaca 
caca t tga ta 
aactgggcgt 
ctgt tacaag 
a tacagccac 



actgga ttct 
tctacgccca 
tgt ggt caaa 
gagtggtaa t 
at gctgacta 
aegggaegtg 
taa 1 t aqgag 
ctgcatcccc 
t getategga 
t tgatgttag 
caggaactgg 
ta tcctga tg 
tctgggggcg 
tggatcaaca 
ggtgctcaga 



tgcaaat tat 
tgtc:i cgag 
ta tgt 1 1 ta t 
gggcact age 
cgaaaaagaa 
ggactgga 1 1 
ccaacacigg 
gacact taac 
ca tcctgaa t 
gatcca t tea 
gaacttacta 
acaatgeagg 
caggaatt ta 
cagctcaact 
cagtatgggt 



ggcgaggt tg 
ccga - ectge 
gt tea tgaca 
cacactggta 
ccgazggctc 
agcaacgagg 
gagaccgea t 
gcaacagtgg 
aaaaccaacc 
ccct caaata 
tgagagegge 
caata tgctg 
ctat cattt t 
ga tcaggacc 
tgccaa tat t 



2048: 

J . 

206C : 
306c: 
"* ^ "7 " " 

5 076: 
306 4: 
209C: 
3096: 

2102:- 
3ios: 
2114: 
3iic: 

31262 
3132: 
2132E 



<210> 20 
<211> 5053 
<212> DNA 

<213> Aspergillus terreus 



<400> 20 

atggcgtctc 

gecacaggag 

gaegctgetg 

a tcctccctc 

cgcaaacctc 

aagtccaacg 

a tagacagac 

gtgataaege 

ctgatcaacg 

cgcatgcgct 

gtcctgtcga 

aacgegcagt 

ctggccgata 

aacttcgccg 

tataacgega 

atctcatcca 

aegcagcaag 

gcgttgt tta 

ctgcactttt 

aaaga taagg 

gagcaggagt 

gtgeat ttta 

ctgatttatg 

gaegggacca 

ccatccttca 

cgccttgctg 

gaegtcacat 

ggcaageget 

aacggcgacg 

accaact ccc 

caggggttct 

gggacatccg 

t t tggagagc 

tttgggccgc 

aaaggtatgg 

acaggtattg 

caaccggcca 

cgagcttctg 

ttgaccggcg 

ggaeggatec 

ggagttt tga 

gaggect tgg 



tacttttctt 
cagccgtccc 
aatggcgtcc 
ttggcgcatc 
tccgtgacaa 
gtgacatggt 
caattgacat 
aagtgcaaac 
ccggcaccaa 
ccctga tcga 
ccctgatccc 
tccgcgagct 
tgga tccgcc 
ataacaagca 
tctacaacgc 

eggggacctg 

ggagtggtga 

ctgtccgcgc 

tctt egggag 

ac tccggcgc 

t tacgaaggg 

tcgacatcaa 

tgtgataaat 

cctacgcaag 

ccgatategg 

atatcgaegg 

gctggcgaaa 

teaeggggaa 

ggegegaega 

ggagctgea t 

accaagatgc 

ga ttacggga 

tegggagaca 

tgtattacgt 

aaggaagttc 

caatatgatg 

tatgegtett 

gggggcgaat 

egatctgea t 

cgacaatctg 

ct gggactac 

1 1 tct ttgac 



tacggtgttc 

tg tetcgega 

aagagcaaaa 

catcacctgg 

acttegcttt 

agacaatgta 

a taaacgegg 

cgcggccgca 

cgactgcgac 

aaccctaatc 

ctcgggttcc 

ggtccttgac 

ggctcccagc 

ccccaacgac 

tgcggtggcg 

tgacaaagag 

gga tgacgga 

egggaagggt 

get t tatact 

ggtgacgttt 

ggggacgttc 

eggtaagcac 

gataggegae 

catcaacaat 

actatggaag 

cgacggccgc 

tggatggatc 

agtcatggga 

ctggatgtgg 

caaaggagag 

taactctggc 

tcaggtctac 

ggactatgtg 

tcatgtgtgg 

tcatagagag 

ggccacgaca 

tatccgaata 

gagattatct 

ctcgcagact 

aacagggccc 

aatatcaatg 

cggccggtcc 



aatctaacac 

cccacagacg 

cgagatgatg 

ggatacctat 

gaaggctggg 

tgcatctctc 

gaaaagga tg 

aactcgctcg 

tacaaegteg 

ggcgccccgg 

acaaccctcg 

atgegegagg 

cccggaaaca 

tacgggtact 

gagctcattg 

tacgggagcg 

atctatcgac 

gcagccgatc 

agggegta tg 

gtt tct taca 

tcgactca ta 

tgtgtctgtc 

ggact tgatg 

ggagacggcg 

agt cccgaag 

gccgactact 

gaagatatcc 

gacctgcgcg 

gttgatgacg 

tctggtgacg 

ccgtcgcatc 

tttgegegae 

t tcatcaaga 

aagagcaagg 

gttgattget 

atggtatga t 

ggggcctggt 

tcgaccccca 

gggaeggega 

aagtttggcg 

ctgcagatga 

a t tt tget ga 



tggctcttct 
attcgagata 
ccctgaaagt 
cctcaaccgg 
aggtggaca t 
t tcccccacc 
t agaagecca 
cctacaagcc 
accctgcgaa 
aca tggccaa 
aagc taacag 
cgcagaatgt 
actggatcac 
cccaga tggc 
tcaagccggc 
gagtctaege 
aegacagega 
catacaagga 
atgaca tga t 
cgaataatg t 
ataat tgtaa 
tgccgaggaa 
actacatctg 
aegecaagag 
gatacgatca 
gcggtttggc 
ccgcatactg 
gcgtgcga tt 
a tggegctae 
ggttgaacgt 
ccggaatggg 
tcta tggega 
aggatacctc 
gegcaggagg 
aattgtta ta 
ggactacat t 
tgaagtcccc 
agagcagat t 
cggagcctgc 
gaacaaga tc 
gctt tactgc 
tgtt tctggc 



atcatctact 

tatagacttt 

ccctctacgg 

aaa tgga tat 

ggtgggcagt 

cctccgacag 

cagcggcgac 

gaacgtcgtg 

cgccggcgag 

cacgctcatc 

gccctccgtc 

ctcca tegtc 

gtaccccgat 

agacatctgg 

ggacct tgac 

tggcgggttc 

gtatagcggg 

t gacgacgag 

gatctrccac 

ccaca c tgag 

cccggggggt 

cca tctggga 

catcgccttg 

caacaagcct 

ggcacatgta 

tgacaaegge 

gcagccgctg 

cgaggatatc 

gacaacatac 

cgtgtggcgc 

agtaa tattc 

ggtggcggat 

tgacaagtat 

ggctaagatc 

gecgaeggag 

tgga tccatt 

gecgaegggt 

ggcatgaagc 

gatataatct 

aaagacaegg 

cccgagcacc 

aacggcaagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

102C 

108C 

1 14C 

12CC 

12 6C 

132C 

138C 

144C 

1500 

1560 

1620 

1680 

174C 

180C 

18 6C 

192C 

198C 

204C 

210C 

216C 

2220 

228C 

2340 

240C 

2460 

2520 
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ccgattatct gtgcgttgag aaggaccgcc gcacctgggg ctgggtcaat ggggacgacc 2580 
gatgggacta cattgatcaa -tcaagtact ccgaggagaa ggacagggcg aatczacacz 2640 
gggccgacgc caacggcgac ggaaaggccg atatgatctg gacagacaag ttcccgggag 2700 
atgggtcggt gtggcacaac cttggccaac gtgatatcaa gggatcgcga tacgaatggg 2760 
gaccgcaggg tcccaagtac cgaggggcgg ttgaaggctc atgcacttat ttccctgatc 2820 
tgaacggcga cggtcgtgca gacangcaca gcatctggaa ctccataaac aacacagcgc 2880 
agacgtggta caacgaatgt gccaccaaag accacacagg cgatgacggc ccgataacta 2940 
accccaatct acctgtatct cctqtaaaag cccccatcga gctcacccct cattatcagg 3000 
acaacagcga gtgcactagg gcccaggtgc agacgctctt tgaagaaatg caatatgccc 3060 
ttgatgctgc ctcggaagtt gcgtacttta gcggcggcgc atacgaccca tataggqaca 3120 
tcttctttgc cgaatcactc accgacagct tgaccttcac tataaatgta aggtatacgt 3180 
tcgaccggat ggtcaccatg atttctgggt cttcgcaatt cgacgacgaa aagttcacaa 3240 
tcacttgcaa aaaccttcgg ggctgtgacg agaacggctg gttggccatg atgaacaata 3300 
ggaatcggct taatttctgc ccaaagttct tcacagatga gttgaagagt tccaggtcag 3360 
tgctcgcgag gtgcgactca attaatcttc atgatgccca tctcactcga gccggggcga 3420 
ttttgcacga agtaacgcat acggactatg ttatggagat tgtcaatgga gagaatgggt 3480 
ggtatcacat attttttctc attcccaggc cgatactgat tcttfcacct taggaggacc 3540 
cgcgattatg tctatggatg gaaaggagct cgagatcttg ccgcagggac cttcaatcga 3600 
cactgtatcg aaagaggcag aaaggctgaa agagccgcta atgagctccg tatagccggc 3660 
gatgctaact ggcaacgcag attgctttgc ccaqacccaa ataacctcgg gcaagaaggc 3720 
atctgtgaca gcaagttgtc cgcctacaat gcggattcat gggctcttgt cgtacttggc 3780 
gggtactata ccaagatatg tggtcgacag attccccttc ctgaggagtc tgcttcttcg 3840 
gcggatgact ccagctgtcc ggcctacgat gattcgtctt atgatgctga cactgtgtac 3900 
ggcgtcaacg attatgttca cttcggtgac tcctacgccg ctgggatggg tacaggaacc 3960 
acaaccggtg acagttgccg cgtgggaagt aacagctacg gaaagctcgt ccaggagtgg 4020 
tttgatactg aggatttcac ttataccaac tatgcgtgct ctggagatac aacggttgga 4080 
ctgaataaaa agatcgacca gtggctagga caggacccca cggggactac catggcaacc 4140 
ctgacaattg gagggaacga tgtgttcttc agcgatctgg tttccaactg cgtgctaaca 4200 
atgtggtggt actcgcttga gcaataccgc cagtggtgtc tggagactga agagaaagcc 4260 
cgcaacctga tgcaggatac agggtctgac ggactcggct cgaaacttag ggctgcgtat 4320 
gaaaagatcc tggatagatc tggctctagc gtatatctcc ctgttatcct tatttattcc 4380 
tgtcgtgctg tccttcgtcg cgctgacttt actttagtcg ttcaacctct acgtccctgg 4440 
ctatgtcacc ttcttcaacg aagacaccac cgactgcgac tcaaccacct tctggtacga 4500 
aagcccacac tacgacccgc agcaatccgg caactatgtg tggctcacga ccgacctacg 4560 
caaggaactc aacgacctcg tccgcatgct caactcgtta atccaatcca ccatttccga 4620 
catcaacacc gcccggaata cggagcagat ccattacatc gatatggacg cgcgatttga 4680 
cggccaccgc tggtgcgagc ccggaaccca agaaccagac cccgacaacc caaacactta 4740 
cttcttccta tccgcatggc ccgatatcgc gattgttgga gacacgacag ccgagagcac 4800 
gaacgcgacc gagacagacg aaattaccgc gcttatgaac tccggatcga tccagctgcc 4860 
cgatgcggat acgtgccagg atgcgctggg atctgacccg gatccctaitg cggttttcat 4920 
gtgtgacgtt gcggtccacg tcaaggcgaa ctcgtcgagt ttgatcgcgc agagcttgga 4980 
ccgagcgaat caggccattg ccaataggga ctatagtagc caggatgtct cttggtggtt 5040 
gcctagtccc tag ' 5053 



<210> 21 
<211> 987 
<212> DNA 

<213> Aspergillus terreus 
<400> 21 

atgactctac caacacttcc taactggata 
ctccatcagc accgttcccc gaaatacgag 
tcacacagaa tcctcatcat cctaaccaca 
caacttcgaa actccacgca cggcatctcc 
gcaacagaac acttcaccat cctattcgca 
atccatgagc cccccacgac cggcgacggg 
atgggatgct tagtcctctt ctgccaagca 
aaactcatcc tactaaccat atacattcaa 
atcgacgcaa tcaccactcc cgaggaaacg 
aactggctga tcggactctt tctttccgcg 
gtgctccgca tcgcgggatt catagatcag 
cagccatcgg tcttaagcct gacaggcctg 
tctggactct gggtactcag ggttcagcag 
gtggattgga tgtattggta ccatgtaatt 
gcgctgggac aatgggtttt gttttggtat 



aggatgtgcg tgcatttgtc ccttacacat 60 
tctataccta ttaaaagtat ccaggctaat 120 
gcctccttct acccgcagat ccggtgcatc 180 
actgcctaca tcctct tcaa cctaatcagc 240 
ttgctggtaa acagcggcgg agatgtcctc 300 
ttgaacctgt accagctttt cgcagtgtgg 360 
atccatagcc tccacgccaa tccacgccgc 420 
tacctatgca tttctatctt accagaggtc 480 
agaaaacaaa ggccgccaac gggcgagagg 540 
cacgcgatga ccgtcctgcc actatcggcc 600 
tcgcgactga tctcgcggcg cagacgggag 660 
gcgtgtcagg ccgtggtctt tgctctagtt 720 
cctgttcctc gaatgccgat gagaagacct 780 
gggtggccgg ttgtcgacga tgcggtttat 840 
gcggtttgtt ggcgttctcg gggcgatgct 900 
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agggatgaag cagtccatgc tcgggagact gatgacctct taggagagga tgaagggcat 960 
gggtacggcg gaaccgggac tccttaq 987 



<210> 22 
<211> 972 
<212> DNA 

<213> Aspergillus terreus 



<400> 22 

atggtcggga 

gtgacgatgg 

gtaca ttggc 

ct tctcgatc 

a ttgagccgc 

tggtacgatg 

gacctgctga 

cgtgatgaga 

ctcaaactgc 

accatgcgca 

ggagagctgg 

gcaaaatacg 

gcccaactgg 

cccggccggt 

tacgactgga 

cgcattcatc 

ggttcactat 



gcaagttagc 
caattcaagc 
t cgagcccca 
ccatta tcca 
ctcgctacgt 
cagccggggc 
tcggtgggt t 
tccggacgg t 
tgga tagt tg 
gcta tgcat t 
tggcggtagc 
atccttaccg 
agaacaccaa 
tct ttgcctc 
aggtggtccc 
ccaccacgaa 
ag 



ccataatgag 
gcgccagctg 
gggagccaaa 
ggagcgacgt 
agactcga tc 
gcaactggcc 
ggtggacatc 
catcggccaa 
tctcaaggag 
gcagga tgtg 
tgccgaccgc 
gtatatgcgc 
cggggaccac 
taaggaga tc 
cgacgaaccg 
gctcatgatg 



gagtggct tg 
cgcctctggc 
c tccgggcgc 
gcggaaagag 
cagtggttcg 
atggactt tg 
gtccgaca tc 
gggggttgga 
tcacagcgcg 
actttctcca 
atgagcaacc 
ctgcgagagg 
atcggcttcg 
aagatgatgt 
t tgcagtact 
cgccggcgcg 



aca tcgccaa 
ccgtcattct 
aggt tcgacg 
atgcctgccg 
aggatactgc 
ctggtatc ta 
cccatctcct 
cacctgcctc 
tcaagcccgt 
atggaacct t 
ccgaggtctg 
acccggctaa 
gttggcatcc 
tagcctactt 
accgccattc 
acgaggata t 



gcaccacgcg 
gcgccccct t 
agcccggcaa 
ggcaaagggc 
caaggggaaa 
cggaacctcc 
tgagcccctc 
gctgtacaag 
cgaatgtgcc 
tatcccaaaa 
gccagagccg 
agcgttcagt 
acgggcttgc 
gctcatacga 
t ttcagcgtg 
ccgccttcct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

972 



<210> 23 
<211> 771 
<212> DNA 

<213> Aspergillus terreus 
<400> 23 

atgcgttacc aagcatctcc agcgctggtg 
ggggctggct gctctcccgc catcttccgc 
cgcgaaaact ttgaattcgt ctacgtgaca 
attctccccg tcttcgccga cctagggcca 
aacaatcata atggaccctc cgtgagcgaa 
cgcaccattg tcgactggca gactcaacac 
ttctccgaag gtgccctggt gacgaccttg 
ccctggttgc cccggatgag tgttgcgctg 
agccagtata tgaggaacga agtgatgaag 
accgagtggc aggaggaact ggtcattcgg 
gattttgcgc tcgcaggatc gaagatgctg 
caggtattgg agtttgctgg gcagcatcag 
gtcattaatc gttttcgtaa gctgtgtgtg 



aaggcgcctc gagcgcttct ttgcatccat 60 
gtgcaattgt ctaagctccg ggctgcgctg 120 
gctccgttcc cttcctctgc agggcctggg 180 
tattactcct ggtttgaaag cagcagcgac 240 
cgcctcgccg ccgtccacga ccccatccgc 300 
ccccacatcc ctatcgtggg tgctatcggt 360 
ctcctctggc agcagcagat gggtcacctg 420 
ttgatctgtc cctggtatca agacgaggca 480 
aaccatgacg acgacaacga cagcaaagat 540 
ataccgacat tacatctgca gggtcgcgat 600 
gtggcgcgcc atttctcccc ccgagaggcg 660 
tttcccaatc gaccgcgcga cgtgttggag 720 
acggcccaga cattggagta g 771 



<210> 24 
<211> 1253 
<212> DNA 

<213> Aspergillus terreus 
<400> 24 

atgggcgacc agccattcat tccaccaccg 
gatgaagtca ccgtctggaa tgccgcaccc 
gtccgcgtcg aggccgtggc gatcaatccc 
acgccctggg cgtttctcgg aacggactat 
gtgactcata tccaagtggg tgaccgggtc 
accccggatc agggggcatt ctcgcagtac 
atccccaagg gcttgtcgtt cgagcaggct 
ggattggcga tgaagttgct tgggctgcct 
acccactcca agccggtgta tgtgttggtc 
actatgcaaa tgctccgcct gtaatgcttc 



cagcaaacag cgctgacggt aaatgaccat 60 
tgccccatgc tgccccgcga ccaggtatac 120 
agtgacacga agatgcgcgg acagtttgcc 180 
gccggcacgg tcgtcgcagt gggttcggac 240 
tacggggcac agaacgagat gtgcccacgc 300 
acggtcacgc gaggccgtgt ttgggccaag 360 
gccgcgctac ctgcgggcat cagtaccgct 420 
ttgccatcgc cttcggcaga ccagccaccc 480 
tatgggggca gtacggccac tgccactgtc 540 
ccttgtcctg agacttttct ctccgttggt 600 
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cgtgggctc t 
t tgcaaca :c 
ttgactatcg 
acccctccca 
acaccaagaa 
tct get t cgc 
aacacgcggc 
gcgaggga t c 
teggegagga 
ateegt t gec 
ggaagggaga 



acaaccgatc 
ctccccccac 
ggccccgaat 
gt ccactttg 
caa tctccgc 
ageca t cage 
ca cgcgcaag 
aacct ggece 
t ctgt ggege 
eg tggtgcag 
gctg t egggg 



Gt tctfiCicc 

eat l tcgacc 
ctcgcgcaga 
gect tacaga 
tatgetcteg 
cgcgcggggg 
a tgg tcacga 
gcccccta tg 
a tegegggge 
ggegget teg 
gagaaactcg 



catccoctcc 
i gaccaaa z c 
cgatcctcag 
acagacta 1 t 
actgt atcac 
ggcactacgt 
ccgactggac 
ggcg t ccegg 
agetegtega 
atcacat taa 
tggt tegget 



cagct ccgga 
Gcgcggcgca 
tgaacccctg 
ga ta t tct tc 
caacg tcgag 
ct ccctgaac 
cct ggggccg 
cagtgaggaa 
aga tggaege 
gcaaggca tg 
egaggggecg 



tatcttCCaa 

gacgaggt c t 
ccaccgc t ct 
tagegtaect 
tccaccaca t 
ccgt t ccctg 
accatctttg 
gageggcagt 
ctegt cca tc 
gagctcgtcc 
taa 



660 

72C 

78C 

840 

900 

960 

1020 

1080 

1140 

1200 

1253 



<210> 25 
<211> 1394 
<212> DNA 

<213> Aspergillus terreus 
<400> 25 

atgggatcca teattgatge tgetgeggea 
ttcegcaagg ccgtgaaatc caggcagatc 
agtggtgaga gaccccaatc ggaccccttt 
acagegggae atacctaggc aatctaaatt 
gaegggaega gtgcaatcag ctgccgccgc 
gtgcgaccaa gctgctgacc acgatcatgg 
acttggatga gacggtggat aggctgette 
ggtttgacga cgegggaaac geaagattge 
acctgctgac gcatacatcg ggactgtcgt 
acatggccca gggccacctc cagteggcag 
cgccggccgt caacgaccct ggggcggagt 
gtaagctcgt cgagcgggcc accggcctcg 
gtgcgccgct gggcatcacc gacatgacct 
cgcgccgggc cgaccaaacg caccgcaact 
eggtctaett ccgggccgat ggagaggagt 
ctgggtccta tatgaaggtg cttcactcgc 
cacagaccgt ggacttgatg tttcagcctg 
accagcacat ggacgccagc ccacatatca 
gtegcagett tgggctgggg gggatcatcg 
gecgaaaagg ttccttgacc tttgggggtg 
ctagattcct ttggtttatg tcaccttcaa 
attagcaaat cgaccccaaa gccggcctgt 
ggaatgaccc ggtctgtcgt gatctgacac 
accagcaggg ttaa 



geggatcegg ttgttctgat ggaaaccgcc 60 
cccggggcgg tcatcatggc ccgagattgc 120 
gcgacaatta caagcacacc gagacgaatg 180 
atacgegctg ettegggget cggacggtgc 240 
tacaggtcga caccccctgc cggctcgcca 300 
ccctacaatg catggagege ggtctcgtgg 360 
eggatttgag cgcgatgccc gtgctggagg 420 
gagagegteg ggggaagatc acgctgcggc 480 
acgtcttcct ccatccgttg etcegggaat 540 
aaaagtttgg catccagagt cgcctggcgc 600 
ggatctaegg cgccaacctg gaetgggegg 660 
acctggagca gtacctgcag gagaatatct 720 
ttaagctgea gcaacggccg gatatgettg 780 
eggeggatgg gcgcctgcgc tacgacgact 840 
getteggegg ccagggggtg ttctcgggcc 900 
tgttgaagcg agaegggetc ctgctgcagc 960 
ccctcgagcc gcgactcgaa gagcagatga 1020 
actacggtgg gccgatgccc atggtccttc 1080 
ccttggagga tctggacgga gagaactggc 1140 
gcccaaacat tgtgtgggta agctcggctc 1200 
tgttgatcca gectactaag tgctattgta 1260 
gcacccttgc gttcttccaa ctggaaccct 1320 
gcacattcga gcatgccatc tatgegcagt 1380 

1394 



<210> 26 
<211> 3504 
<212> DNA 

<213> Aspergillus terreus 
<400> 26 

atggatccgg tggttagaaa gccggaccct 
ttgcgtgcca ttgtgggaca cgcgtgtcga 
ctgaccgcgg caacgaccca tcttcatgtg 
ggtctggccc cctgggccaa ggaaaccccc 
cgcactgtta gectgggaga ggctagegea 
gtgeeggagg atggccaggt atgatagatc 
cgtgctgacc ggaccgtccc tgtgctactc 
tcaccctcga tetacegggt gcgtctgtcg 
tctcagggtt cctcggtgcg gaacagacca 
atcattccgc gttgacgttt cgagttccct 
tcgaaattat accctcggaa aaagaggatg 
aaggaagtcc caggtcactg ggacactggc 
gtgttcacca tgeggagaca gtcgacttgg 
atatgactgt ggtctctctc tttcgggtga 
cagcctcggt cctgctgtct ggtgcctttg 



ggcggggtgfc agcatcgagt gaccaaagca 60 
catcccattc acactctgct agtcacggcg 120 
ctggaaggga catatcaggc tactcacaga 180 
ttgaaegtec agtcatttct ctggggaagc 240 
tggaaatggc agatagacga ccgacctaag 300 
tcctgcgctc cattggtccg agaaattctg 360 
tcaattagtc tgactttcac tgggctcttg 420 
aegecagtat ccccttccta tcaaacacgc 480 
cgcccacccc cgattcatcc ccctcacccg 540 
actcccaact agatggcttt ctacaggctg 600 
atagttggag actgaggtct cctcgcgaag 660 
teggaagetc atggctgtca ttccttcatc 720 
tgatcatagg gctcagttac etagecatga 780 
tgcgccacct cggctcacgc ttctggttgg 840 
cttttgtact cgggcttgga atcacgacta 900 
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ca t gcgacgt 
cagt gggcc t 
t g tggggegg 
accaa t tga t 
gatctcacct 
gt cz tggeca 
t taccttcta 
gagggct tgg 
cgacaaacau 
tccaattg tc 
cccctac tgc 
cggcgcgcgt 
tggaat cage 
atggattgga 
tcctcaacaa 
ccgccgtccc 
gt actcgccc 
eggacgaega 
agact ctcga 
cgcgtgccgt 
gcagcggcct 
gcgcctgctg 
tggtgatcga 
cgagcgccag 
aaggtgatgg 
aagcccagcg 
cgaccagccg 
aca t cegg tt 
t tgaaaaagc 
cct tatctgg 
gcggcaagtc 
agaccgacgt 
tggcgggcag 
tggccactgg 
aaaagtaggt 
gtgtgtttgc 
gccctcgatg 
gttggacttg 
ct tggcccgc 
tgcggcgggt 
caagaaatga 
ggtacagagt 
gacatccatt 
t tgtcatgta 



gcccgtcgac 
t gagaagecg 
ggggcacegg 
t cc taaca tc 
cct ggaaa tc 
tttttgette 
tgccacca 1 1 
t caag t taat 
cacctggtgg 
geect tct tc 
tgtcagtcct 
cgaggggt tt 
tgggt tcaa t 
aagcccgctg 
ccacctga tc 
tga tggctcc 
cccagaagaa 
gctggtggaa 
gcgga ttgcg 
geggattege 
ggeggagtea 
tgagaacgtg 
tggacaggcg 
tegeggatge 
tatgacgegt 
ctgggt tgag 
gt ttgegegg 
ccgcaccacc 
cctggaggcg 
caatt tctgt 
cgtcatcgcc 
cga tgcgctg 
cctgggcggc 
tcaggatccg 
agcttctcta 
taatggtggg 
gaggteggea 
ctaggtgtcc 
at tgtcgccg 
cacttggtca 
agggatcget 
aeggagcaat 
tcgtgcaaat 
act taggtag 



a tget tct tc 
a t ccaactaa 
caacz cccca 
a tccaactcg 
ggcgcgt tgg 
ctggcggca t 
ct 1 1 gcg t ga 
gccaagcatc 
aagctat tga 
ta tegggtea 
t tcaaagaag 
gagacacgeg 
atatcageca 
ggtcgacta t 
cacgctgctc 
tact tgtegg 
acagaggccc 
ctgtgt ct cc 
gcggga tea t 
cgtgccgctg 
ttgctccctt 
gtegggtace 
ttgttcattc 
aaagegatea 
ggtccctgtc 
tctcctctcg 
ctccaaaccc 
aegggegacg 
atggccgccg 
t ccgacaaga 
gaagecaega 
gtcgagctca 
ttcaacgccc 
gcgcagaatg 
cgttttgatt 
tttctagcat 
egattggegg 
ggggegcaca 
cggccgtgct 
atgeccatat 
gtgat tga t t 
tagaacaccg 
cgaa t a taaa 
gtaa 



1 1 1 tcgaagg 
cccg tgctq t 
a t gccgccag 
eggt tga teg 
ca tt aggggc 
ggacactcct 
a a t tagaga t 
ct tegggga t 
cggtgggcgg 
t gggagaata 
cggccaacgg 
taaccgttct 
etaaaegtte 
gtctcatggg 
gctggcatgc 
tgeca tgetc 
tcctcaaatc 
ggggtaaga t 
cccgctcggt 
tgtcgaaaac 
a tegegacta 
tgcctctgcc 
ccatggccac 
atgctggcgg 
tgcgattccc 
ggcacgaggt 
tgacggtggc 
cgatgggcat 
agggtggatt 
aatccgccgc 
t ccccgcgga 
acacggccaa 
atgcctccaa 
tggagagcag 
ttctcctccc 
cgatggaaac 
aggcaccatt 
t tecaeggag 
ggctggcgag 
gcaacacaat 
cteggggcag 
gtata tgtgg 
aa tacctacc 



aatcccgtac 
tetctgegtg 
cagtga tga t 
agaggggtgg 
ggtccttegg 
ga t tgatgee 
cacgcgaatc 
1 1 ttgggcac 
ettegttcta 
ta tggctaa t 
actcaacgag 
gccgccactg 
tacatttgac 
cgea t tggtc 
ttggccccaa 
tgccactgcc 
gaaccaagca 
cgeggggtae 
gacccggctg 
gccctccact 
taactacgag 
cctgggagtg 
aaccgagggc 
cggtgccact 
gtcggcccaa 
tctggcggcc 
ccaggcgggc 
gaatatgatt 
tcccgacatg 
cat taactgg 
gactgtccga 
gaacctggtc 
cctcgtccag 
tagttgeatt 
ggttatatat 
ctgcacatcg 
cttgaggccc 
cctggcgcca 
ttaagtacct 
cgcagtgcgg 
cttcaaagga 
taatct taga 
tacgtagaaa 



cccgitctga 
t eggaagaac 
ageeggcaaa 
tata ttgtgc 
ccaaaggata 
gttctacttt 
cggagcccag 
aaggtcaagt 
tgtcact tec 
ggtactctgc 
atctacctaa 
cagtaegtet 
ggtgtgctcg 
gt tagectgg 
gcgagagagt 
cctgaagt ct 
gaatctctga 
agt t tagaga 
gaggcattta 
cagaacctct 
ct tgtgcatg 
gccggaccca 
gtgctcgttg 
accatgctca 
cgtgcagctg 
gect tcaacg 
atctatctct 
tegaagggeg 
cataeggt ta 
atcggcggcc 
caggtcctga 
gggagtgcca 
gcggtgtttc 
acgaccatga 
a ttcacgtgg 
ctgtctcgat 
agggageca t 
atgcgcgccg 
gcgcggctct 
gtgccacagt 
cgctatctcc 
aca tgcggga 
agtacctacc 



96C 

1C2 : 
ice : 

11 hZ 

12:: 

12€Z 

132: 
i3E : 
14-. : 
is: : 
i5c: 
162: 
16= : 
174 : 
lec: 
i8c: 
192: 
1 9? : 
204: 
21c: 
2ie: 
222: 
225 : 
234: 

24:: 
24c : 

252: 
256 : 
264: 
27CC 

2i ec 

2820 
2860 
2940 
30C0 
3060 
3120 
3160 
32 4: 
33C: 
336: 
342: 
34c: 
35C4 



<210> 27 
<211> 1512 
<212> DNA 

<213> Aspergillus terreus 



<400> 27 

atggctgcag 

cgcaccggtg 

caaaagatca 

cagcaggctg 

tccagggcag 

ccagtgccct 

eggcaattte 

gaggctattg 

ttagagccaa 

t tgccaccgg 

gaectgtegg 

ccgaagcagg 

a tagcacagc 

ctagatccca 



atcaaggta t 
gaacat tacc 
aatgtactgg 
gacttcgatg 
eggatctegt 
cacagagctt 
ttgatccacc 
acactgactg 
cgctgccgga 
tatcgagega 
cggtgtcgca 
aaatctggac 
gecgacaaaa 
teggaa tgt t 



at t cacgaac 

ccgccgtgca 

aaa taaggag 

cgtctacagt 

ctctgctgac 

gccgctagac 

ggacagctac 

ctgggggctg 

tctacct teg 

cattgetegt 

ggaactggaa 

ccgtgcgtcg 

cgtatgggca 

t t tcaa tgcg 



tcggtcactc 
t tccgacgct 
gttactggcc 
gagegatgee 
ccagatccct 
gtatccgagt 
gactggtcgt 
tcccaatgtg 
cccttcgagt 
gcggccagtg 
gagatccttc 
cegcat tccc 
aactggctaa 
t cacgacggc 



tctcgccagt 
ct tgtgatcg 
gtgctccctg 
ccaagcgcaa 
get tgcacat 
cgcattcctc 
ggacctcgat 
atggaggctt 
etaeggt tga 
cgcaacgaga 
tggccgtgac 
caactgcttc 
cagacttgea 
ttcttactgt 



ggagggttca 60 
gtgtcatgca 12C 
tcagcgttgc 18: 
gctacgccaa 24C 
gtcctcgcct 30C 
aaatacctcc 36C 
tggcactgac 420 
cagctgtcag 4 8C 
aaaagctccg 54C 
gcttttcgat 60C 
ggtagaatgg 66C 
ccgtgagagg 72C 
tatgttctca 78C 
cctgcgccaa 84C 
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caagcgcagg 
tccacggcag 
ctcctgtcgc 
t cccag tcgc 
a t accctt ct 
acacacgccc 
aatgagat ta 
ggggaaccag 
cagccgacca 
caggaggcaa 
gagga t a tct 
at tcctcca t 



ccgact gcca 
t acactc 1 1 a 
aaa t taggcg 
cgagcagaga 
1 1 a gcgagaa 
ta t tctcggc 
ct ctgggagt 
gcgagga ta t 
ct ccagcggc 
agt ctgctgg 
1 1 tccct cgc 
ga 



t caaggcaca 
catat t ga a t 
gacccagaac 
cgacaccagc 

CCtCCCtot t 

t tgcactacg 
acact ccgcc 
agccaggaca 
t cgggt tt tg 
t tcccgaggt 
ccgcaaacac 



c tagacaca z 
ct gcgga 1 z z 
agcca ta t ga 
agcagcagcg 
ggtgagctgt 
ttaca tgt tg 
cagggca t tg 
ggggcgacca 
t tea tgttct 
cgaacca teg 
aaacat ggca 



g tt t aegga c 
t gaccccca t 
ccccact gga 
gccacagcag 
tct cctatgt 
gggtacaa t t 
cagcttcca t 
at teegcaag 
tgagtgat ga 
cagcactgcg 
tgctcagaga 



caagaacc tc 
a teggagt tg 
agggagtcga 
tgttgacacc 
tgaccccctg 
getgegtgag 
cagca tgagc 
a tgegaggae 
aggggct t tc 
aegatgetat 
cctcaacaa t 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1512 



<210> 28 
<211> 2161 
<212> DNA 

<213> Aspergillus terreus 
<400> 28 

atgacatccc accacggtga aacagagaag 
aatcatgtca ctggcctcag getaggectg 
ctgatgetet tggatatgtc catcattgtc 
ctccgagcct tggagacaac tgactcttca 
cagtttcatt ccctgggcga tgtcggatgg 
tcgattttcc aacccatgcc ctcttccttt 
cacgctctag ctgtgccctc caacccttgg 
aatacacctt cctcgctttt ctcgggttgt 
ctcgttcgtc aaccatgt-tg attgtagggc 
tcaccaatgg cgcaatcacc attctgtegg 
agtactgata gccagaccta tctcaaccgt 
catagtcttg attgggatca tgatgggccg 
aatcatcaag catagtttct gactccattc 
ccgttgcttg ggggtgcttt cacgcagcac 
tggatttatc ggttcagtgc ttgetttetc 
aagatctttc getcaeggtg attctggtcc 
gtttgecaca tttctccttc tcgtcatcca 
ttcaaccaca gaeggcacaa accccaagag 
ggatttcctt ggattcgtgc tcttcgccgg 
gtggggtggg tctgattatg cgtggaatag 
gggcgtgtcg ctggtgctgt teggatgetg 
gattcccatt tccgtggcca gtcgtcgcca 
tttttccggg gccctactta ttttctccta 
gaatgtttct cccaccatga gtggagtgta 
catggegatt gtgacgggtg caatcagttg 
ttataaccta tggcgttact gacaaattga 
ttccgtgggc gctcgcaagc gggatccttg 
tccagccgga aacctcgatt gcagcatggg 
gaggatgegg aatgeaaace gtaggtgacc 
ettatgeaaa tgctcattga ctcggttgtc 
tgcgctgcct ccacaaacga gccccatcgg 
cggtggctcg ctttttctca ccctgaccga 
tctgcgccaa tatgegecaa ccctcaatgc 
cttccgccaa gtggtccccg ctcctctcat 
cgtggaccat gcattctacg ttgeggtegg 
gggtatgggc cggcttgcct ggagaggctg 
a 



ccacagagca acacggctca aatgeagata 60 
gttgtggttt cagtcactct ggtggcgttt 120 
aeggtcagea tggcaccagc ctggagattg 180 
cattegcagg cgattcctca cattaccgcc 240 
tacggaagtg cgtatcctct atcaaggtga 300 
tctccagccg ggtttctatt gactccacga 360 
caggcaaact atacactctg ttgaccctga 420 
ttgagattgg ateggttett tgcggcactg 480 
gagcagtggc cggaatggga gggtegggge 540 
cggcagctcc aaagcaacag caaccgcgta 600 
tgttatgcta tgctgacccg gatatttaca 660 
tcagttcgcc aacccattgg gatccccgga 720 
ccagtaagcc aaatcgecat tgtatgtgga 780 
gcaagttggc ggtggtgtat gtatccccat 840 
aaaggacctt ggctacgact ccgccacgtc 900 
aggtttttac atcaaccttc ccattggggc 960 
gatccccaac agattgecat ccacgtcgga 1020 
aagaggggct egggaegtet tgacccaact 1080 
ttttgegate atgatatctc ttgctttgga 1140 
ttccgtgatc ateggcttgt tctgtgcggc 1200 
ggaaeggcat gtcggcggtg cagtggccat 1260 
agtctggtgc tcctgcttct tcctcggctt 1320 
ctacctgcct atctacttcc aggeggtcaa 1380 
tatgetgecg ggcattggtg gaeagategt 1440 
agttgccacc attccaccac ctttcttcgc 1500 
gggtggtagt eggtaaaaca ggctattacg 1560 
tgtccatatc cgccggactg gtatcgacct 1620 
tcatgtatca gttcctggga ggcgtgggcc 1680 
tggatcgttt ccatcggttt gcgccgcact 1740 
cctcctttag cctgtcgtcg ccattcaaaa 1800 
catttegcta gccatgttcg gecagacatt 1860 
attggttttt agcaatggtt tggactctgg 1920 
acaggaggta acagccgcag gggccaccgg 1980 
ctctcgggtc ctcttagcat acagtaaagg 2040 
tgcgtctgga gctaccttca tcttcgcctg 2100 
gcggatgcag gaaaaaagac ggagcgaatg 2160 

2161 



<210> 29 
<211> 8035 
<212> DNA 

<213> Aspergillus terreus 
<400> 29 

atgacaccat tagatgegee cggtgcgcct gctcccatag ctatggttgg catgggctgc 60 
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age tttggcc cacqcacaac agatccccac aaactctcaa aattgctgga ggaaggaggg 120 
agcgcctcct ctaagattcc iccttcacga ctcaaigtcg gcggggtcua ccaccccaat 180 
ggccagcgcg ^aggatcggt gagtatgaag gar:c:ggct: tgagcattt: tgaggcccat 240 
atcttcccgc ccagaacgat aggcgttgac tgcgagtaga tgcacgttcg cggtggacac 300 
tttctcgacc aagacccggc tcttttcgat gcctcatctt ccaatatgac tactgaagtt 360 
gccagtgtac gtcccgcgat cgtcgtccag ttgtgtacgg atcagaagcg gaataaaccc 420 
atgctaagac tgccgaatag tgtatggacc cccagtaccc actcatacut gaagtccttr 480 
atgaggcgct cgaagccggt atgtattata ttcct tggt t tcccacgccc g:at:aactc 540 
cccatggctc cgcagcggga a t tcctctcg aacaggtctc cggctccaac actggggctt 600 
ttgcaggaac catgtatcac gactaccaag qczcczzcca gcqccaacca gaagcccttc 660 
cacggtattt cataacagga aatgctggca ccatgctcgc gaatcgcgtc tcccactttt 720 
atgaccttcg tgggcccagt gtctcgatcg acactgcctg ttccacaacc ttaacagcct 780 
tgcatcttgc cattcagagc ttgcgagctg gagaatctga tacggcgatt gtcgctggcg 840 
cgaacctgct actcaatcct gacgtcttta ctaccatqtc caaccttggg tgagtctggt 900 
gttcaatcca tctagtgatc agcattcttg ttgcacagac aatatgtgat gttaactgtg 960 
atgtgctgcg accagcttcc tttcgtccga tgggatttcc tactcatttg actcgagagc 1020 
ggatggctat ggtcgcggag aaggagtggc tgcgatcgtc ttgaagactc tgcccgatgc 1080 
ggtgcgagac ggagacccga tccgcctcat agtgcgcgaa acggcaatca accaagacgg 1140 
ccggacccca gccatcagca cgccgagcgg cgaggcccag gagtgcctga tccaagattg 1200 
ctatcagaag gcccagttgg acccaa'aaca gacttcgtac gttgaggccc atgggacggg 1260 
aaccagagca ggagatccgc tggagcttgc agtcatctcg gccgcgttnc cgggacagca 1320 
gatacaggtg ggctccgtga aagccaatat cgggcataca gaggctgtca gtggtctggc 1380 
gagtttgata aaggtggctc tggctgttga aaagggggtt atcccgccta atgcaaggtt 1440 
cctccagccg agcaagaagt tgctcaagga cactcatatc caggtagcat tatcttcacg 1500 
attttttcct ctcattctat tctttctatt ccagctcctc gctgatttac aaacagattc 1560 
cactgtgtag ccaatcatgg ataccaaccg atggtgtccg ccgcgcatca ataaacaact 1620 
tcggtttcgg aggcgcaaat gctcatgcaa tcgtggagca atatggcccg tttgcagaaa 1680 
catcgatctg cccacctaat ggttattctg gcaactatga tggcaattta ggaacggatc 1740 
aagcgcatat atatgtgctg agtgccaagg atgagaacag ttgcatgaga atggtttcaa 1800 
ggctgtgcga ctatgctacc cacgccagac cagccgacga tttgcaattg ctcgcgaata 1860 
tagcatacac gcttggttct cgtcgctcga acttccgatg gaaggcagta tgtacggcac 1920 
acagcctcac gggtcttgcc cagaatttgg cgggagaagg catgcggcca agcaagtcag 1980 
ccgaccaagt aagactggga tgggtgttca caggccaggg agcgcaatgg tttgcaatgg 2040 
gtcgtgagtt gattgagatg tatcctgtct ttaaagaggc cctgctggaa tgcgatggat 2100 
atatcaagga aatggggtca acctggtcca ttataggtaa agacccgcaa caagtccccg 2160 
gcccaggcta tggaaagcac tcactcatgt caccattgca gaggaactca gtcgccctga 2220 
aacggaaagt cgcgttgatc aggcagaatt cagtctgcca ttgtctacgg ctcttcaaat 2280 
tgcgcttgtt cgtctgctct ggtcgtggaa catccaacca gtagccgtca ctagtcactc 2340 
cagcggagag gcagctgcag cgtacgctat cggggcacta acagcccgct cggccattgg 2400 
aataagctat atacgcggtg cattgacagc aagagaccgc ctggcgtcgg tacataaggg 2460 
gggcatgttg gctgtcggat tgagccgcag tgaagtgggt atatacatca gacaggttcc 2520 
attacagagt gaagaatgct tggtggtggg gtgtgtcaac agcccgtcga gtgtgacggt 2580 
ctcgggagat ttgtccgcca ttgccaagtt ggaggaactg ctccatgctg atcgtatatt 2640 
tgcgagacgg ctgaaagtca cccaagcctt tcactccagc cacatgaact cgatgacaga 2700 
tgctttccga gccggtctta cagaactctt cggagcagac cccagtgatg cagcaaacgc 2760 
cagtaaagat gtgatctacg cttctcccag aaccggggcc cgcctgcacg acatgaatcg 2820 
tcttcgggat cctatacact gggtcgaatg catgcttcac ccggttgagt tcgaatcagc 2880 
attccgtcga atgtgcctgg acgaaaacga ccacatgcca aaggtcgata gggtcattga 2940 
gattggacct cacggagcgc ttggaggccc gatcaagcag atcatgcagc ttccagagct 3000 
tgccacgtgt gacatccctt atctgtcctg tctttctcgt gggaagagct ctctgagcac 3060 
ccttcgcctt ctcgcatcag aacttatccg ggccggattt cctgttgact tgaatgcgat 3120 
caactttccc cgcggatgtg aagcagctcg ggtccaagtg ttgtctgatc taccgcccta 3180 
cccttggaac cacgagacca gatactggaa agagccgcgc atcagccaat ctgcccggca 3240 
gcggaagggc ccagtccacg atctgatcgg attgcaggag ccgttgaacc tgccgttggc 3300 
gcggtcatgg cacaatgtgc ttcgtgtgtc agatttgcca tggctacgcg accacgtcgt 3360 
cggctcgcat attgttttcc ctggggctgg gttcgtgtgt atggcagtga tgggaatcag 3420 
cacgctctgc tcgtccgacc atgaatctga cgacatcagt tacatcctac gcgacgtgaa 3480 
ctttgcgcag gccctgattc tacctgcgga cggggaagaa ggaatagatc tgcgcctcac 3540 
gatttgtgct cccgatcaga gtctgggttc acaggactgg caaagattct tagttcattc 3600 
gatcactgct gacaagaatg actggacgga acactgtacg ggacttgttc gagcagagat 3660 
ggaccagcct ccctccagtt tgtcgaacca acaacggata gacccacggc catggagccg 3720 
taaaacggcg ccgcaggagc tgtgggactc actacatcgg gtgggaattc gtcatgggcc 3780 
cttttttcga aacattacgt gcatcgaaag cqacgggcga gggtcatggt gtacatttgc 3840 
catcgcggac acggcctccg caatgccaca cgcctacgaa tcccagcaca ttgttcaccc 3900 
aaccacacta gactctgcag ttcaggcagc ctataccact cttccattcg ctgggagccg 3960 
gatcaaatct gcgatggtcc ccgctcgcgt cggctgcatg aagatttcct cccgacttgc 4020 
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agatttggag gccagggaca tgctgcgcgc acaaocgaac atgcacagcc aaagtccttc 4080 
cgcattggta acccatgtaq cagt t t t tga ugaggcagat ccggttggag ggcctcttat 4140 
ggagctcgaa gggctggtct ttcagtctcn gggggcaagt ctgggcactt ctgaccggga 4200 
ctccaccgac cccgggaata cttgcagctc ctggcattcg gctccagaca tcagcttagt 4260 
taaccccggc tggcttgaaa aaaccctggg cacaggtatt caggagcacg agatcagcct 4320 
catattggag cttcgacggt gttcggtgca cttcattcaa gaggccatgg aaaatttgag 4380 
cgtaggcgat qtcgagaggc Lgagtggtca tctggccaaa ttctatgcgt ggatgcagaa 4440 
acaactggcg tgtgcccaaa atggcgagct ggggccagag agctccagct ggactcggga 4500 
tagcgagcag gcaagatgca gcctccgctc tagagtggtt gctggtagca ccaacggcga 4560 
aatgatctgt cgcctgggct ccgtgctccc cgctatccta cgtcgggaag ttgatccgtt 4620 
ggaggtgatg atggatggcc acctgttgtc ccgctactat gtcgatgccc tcaagtggag 4680 
tcggtccaac gcgcaagcca gcgagctcgt gcgcctctgc tgccacaaaa acccgcgcgc 4740 
tcgcatactg gaaatcggcg gaggcaccgg gggttgcacc cagctggtcg tggactcctt 4800 
gggcccaaat ccgccggtag gccgctatga ctttactgac gtctcggccg ggttttttga 4860 
agcagcccgc aagcggttcg cgggatggca gaatgtgatg gattttcgga agttggacat 4920 
cgaggacgat ccagaagcgc aggggtttgt gtgcggatcc tacgacgtga tgttggcttg 4980 
tcaggtcctg catgccactt ctaacatgca gcgcacattg actaatgtgc gcaagctgtt 5040 
gaagccagga ggcaaactca ttcttgtcga aaccaccaga gacgagcttg acttgttttt 5100 
cactctcggg ctcctgcccg gctggtggct cagcgaagaa ccagaaagac agtcgactcc 5160 
gtcactaagc cctacgatgt ggcgcagcat gctgcacact actggattca atggtgtgga 5220 
agttgaggct cgtgactgcg atagccacga gttctatatg attagcacca tgatgtccac 5280 
ggccgtacag gcgactccga tgtcatgctc ggtcaaattg cctgaagtgc tcttggtcta 5340 
tgttgactca tctacgccca tgtcttggat atcagatttg cagggagaga ttcgcggcag 5400 
gaattgttcc gtcacttcgc tacaggcact tcgtcaagtt cctcctaccg agggccaaat 5460 
atgcgtattc cttggagagg tggaacactc catgcttggt tcagtcacca acgacgactt 5520 
cacacttttg acctcaatgc tacagctggc tgggggaact ttatgggtca cccaaggagc 5580 
gacaatgaag tctgatgatc ccctgaaggc tctacacctc ggattactac gtaccatgcg 5640 
taatgaaagc catggcaagc gatttgtctc acttgacctc gacccttcgc gtaatccatg 5700 
gacaggcgat tcgcgcgatg ccattgtcag tgttctggat ttaattagca tgtcagatga 5760 
aaaggagttt gactatgcag agcgggatgg agttatccat gttcctcggg catttagtga 5820 
ctccatcaat ggaggcgagg aagacgggta tgccttggag ccattccagg acagccagca 5880 
tctcctgcga ctagatatac agactcctgg gctcctcgat tccctgcact tcacaaagcg 5940 
caatgtggac acatatgaac cagataaatt accggacgac tgggtagaga ttgaaccgag 6000 
ggcgtttggt cttaacttcc gtgacatcat ggtcgcgatg ggtcaattgg aatcaaacgt 6060 
catgggcttc gaatgcgccg gcgtggttac aagtctcagc gagacagcaa gaacaattgc 6120 
acccgggctt gcggtcggag atcgggtttg cgccctcatg aacggacact gggcgtcgag 6180 
ggtgaccaca agccggacca acgtggtgcg cattccagag actcttagtt tcccgcatgc 6240 
tgcctccatc cctctggcct tcacaacagc ttacatttca ctttacaccg ttgcccgcat 6300 
tctgccaggt gaaacggtg.t tgatccatgc cggggcagga ggcgtaggcc aggcggccat 6360 
tattcttgct caattaaccg gtgctgaagt ctttacaact gctggcagtg agaccaagcg 6420 
taaccttttg atcgataaat tccacctcga ccctgatcat gtcttctcga gcagggactc 6480 
cagcttcgtc gacggtatca agacccgcac ccgtggcaag ggggtggacg tggttttgaa 6540 
ctcgctagct gggcctctcc ttcagaagag ctttgactgt ctggctaggt ttggtcggtt 6600 
tgtagaaatc ggcaagaagg atcttgagca gaatagccga ctcgacatgt cgacgttcgt 6660 
ccgcaatgtc tccttctcct ccgttgatat tctctactgg cagcaagcga agcccgctga 6720 
aatcttccag gcgatgtccg aggtcatctt gctgtgggag cgaacggcaa tcggcctgat 6780 
tcatccaata tcagagtatc ctatgtcggc cctggagaag gcctttcgca ctatgcagag 6840 
cggccagcac gttgggaaga ttgttgtgac agtagccccc gatgacgcgg tcctcgttcg 6900 
tcaggaacga atgccactat ttctgaagcc taacgtgtcg tatcttgttg ctgggggcct 6960 
gggtggtatc ggacggcgga tctgcgagtg gctggtcgat cgcggggcgc gatatctcat 7020 
cattctgtct cgaactgctc gcgtggaccc ggtcgtgacg agtctccaag agcggggctg 7080 
caccgtttct gtacaggcgt gtgatgtggc cgatgaaagc cagcttgaag cggctctcca 7140 
acagtgtcgg gcggaggaaa tgcctccgat rcggggcgtc atccaagggg caatggttct 7200 
caaggacgcc ctcgtctcgc aaatgacggc ggacgggttc catgccgccc tgcggcccaa 7260 
ggttcaggga agttggaatc tgcaccgaat tgcatcggac gtggatttct tcgtgatgct 7320 
ctcatccttg gtgggtgtca tgggaggcgc aggacaagcc aactacgcgg ctgccggagc 7380 
gtttcaggac gcgctcgcag agcaccgcat ggctcacaac cagccagcgg tcaccatcga 7440 
cctcggaatg gtccagtcaa ttgggtatgt" agcagagaca gattctgctg tggcggaacg 7500 
actccaacgg atcggctatc aacccttgca cgaagaggag gttctggacg tcctcgagca 7560 
agctatatct cctgtgtgtt cccctgccgc acccacacgg cctgctgtca tcgtcaccgg 7620 
catcaacacc cgcccaggcc ctcactgggc acacgccgac tggatgcaag aggctcgctt 7680 
tgcggggatc aagtatcgtg atccgttgag ggacaatcat ggagctttgt cgctgacccc 7740 
ggcggaagat gacaatcttc acgccaggct gaaccgtgca atcagccaac aggagtcaat 7800 
cgccgtgatc atggaggcga tgagctgcaa gctcatctca atgttcggcc Caacaaataa 7860 
cgaaatgtcc gccactcaga cattggcggg gatcggcgtg gactccctgg tcgccattga 7920 
gctccggaac tggatcacag ctaagttcaa tgttgatatc tcagttttcg agttgatgga 7980 
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gggccgaacg accgccaaaa tcgcggaagt ggtgctgcag agatacaaag cttac SC2E 



<210> 30 
<211> 888 
<212> DNA 

<2I3> Aspergillus terreus 
<400> 30 

atggcgacgc agqaattctt aagcgatgtc tcctccdgat tcttgtctgc tgaagccata 60 
aggtacagag tgaagacggg tgtatccatg gatggatgga tgtatgtgga gatggcacct 12C 
tacatataaa tatacatcat atcaggaaaa ggggatattic gtgtaactct gtgcgaaccg 18G 
acgataaaca tcacttaaga cacctaacca atatagggtt agacacgcct ccgtgtccca 240 
aatcccttcc ggctgcgcac tcggcggtag catcttgtct cacgttcgtt ccgccggacc 300 
catgtgaaaa ttgggaggcg ctgcaggtag cgtgggacaa ggcttgttgc aggaatccaa 36C 
cgccgttgtt ctttatctgc gtttctcttc tgttttcttt ctattccctc tggctgcagc 42C 
gtggcgggtg cgggcgatat ggtgggttgc accgtgtctc taaagtgttt cccaaagtat 480 
ggcccgacga catggattcc cagctaccct caagactaca aaccttagta agtagtaagt 540 
agcttcatag acctatccta attaacctac actaactaaa cccagcacgt ctggggggtt 600 
tcaacccgta atctgcagca gatcctagag cgtaaacccg aacctgcccc aaacaactct 660 
acatacatct caaagggcta tgcaacattc ttcaaccaat tttccttacc acccgtagat 720 
gttacacaga tcctcaatca gacgttgcag caccacgatg ttgagactat taacctggat 780 
tgtggcagtg gcctcttaac cctgcggacc cagctaagga tcttattgat agggaaacct 840 
aagataataa aaccattttc cggtctacgg acgagcatta atgaataa 888 



<210> 31 
<211> 2478 
<212> DNA 

<213> Aspergillus terreus 
<400> 31 

atggagagtg cagagctgtc gtcgaagcgg 
atccgtaagg tccgatgcag caaggagggt 
ctaccctgtg aattctccaa caaagttaca 
ataagatggt ctatattcgg gtcaaatacc 
acgtcgagaa gctcgggagt cgggttggag 
cctttattga tgcccatcag ggctttcgtg 
ggtacacaag ctcaaccagc tcagaagagt 
cttcacccac cgaggaagat ggattctggc 
tcgtcatgga ggcacaggct gctaacgcca 
ccagcggcca agtcgcagag atggtcgcat 
cgaaggtggc tgaggcgaat gaaacgcttc 
cggcatccga aaacgatacc tttctcccgt 
ctatcaacga atatttcaag aagctgcatc 
ttatggacgc agtggaatct cagtacacaa 
ttacctcttt caactgcatt gtgcttcagg 
tcgtgggatg cacaggacaa gacataccaa 
atatcaggca gtgctataat cgattggaaa 
gggccctctt ttgtttggtg cgtcttacaa 
tatactaagg cacttgtggc aatggagtat 
gctcaagtct gcgagttgtc caggctcatt 
acggaagatg gggctgtggg cgaccagcct 
gatgttcgtc aaaaacttgc ttcgttttct 
cacgtatcca tcattggggg caaggcctgc 
ttgcctccat atgactccgc tgcgccacta 
gcattcattc ttgaggagat atatctgggc 
cagagtcgcg tccgccgccg tatccgcaga 
caacatgagc atgtactgcg taccggagat 
gcaacgcagt tgaggtttgc actctcgagc 
agccaggaaa ggggcgctgt ctgcctacaa 
caattgtgcg atgggtgtaa atctggtttc 
accctcttga cagcatccta acctggggat 
ttgatctcat tcatgggaat ctatgtccac 
caggacatgg agatactcac tttcttcgcc 
tcatctgcat ctatctcgta caaattaagc 



caggcatttc ctgcatgtga tgagcgccgg 60 

ccaaagtgct cccattgcct ccgatataac 120 

cgggtcaacc aaacagcaaa attgtaagtg 180 

tgtctttctg actctcagtc agggcgcgcg 240 

atatcgaaca tgccctccaa cgatgcctgt 300 

atctatcaag gccacagtca caagaaagcg 360 

gtgaagtaaa cttgtactca ggcaaacaca 420 

ctctccacgg ttatggctct tttgtttcac 480 

acctaacctc ttggttaccg gtcgatatga 540 

ttgaccgcca agctgtgtca gctgtgcgct 600 

aacagatcat tgaggatatc ccaacactat 660 

ctcttccacc ccgcgctcta gtggagccgt 720 

cacgactccc tatatttagt cgacagacta 760 

tcagaactgg gcctccggac ctggrttgga 840 

ccctgactca aacatcaatt gcgaacaaag 900 

tagattatat gatcataagc ctgctgcgta 960 

ctcttgttaa accccggcta tcgaatatac 1020 

ccctaccttt ggaacgtggc ctgactttat 1080 

tttgatttcg caatttttct gactatcttt 1140 

ggactccatt taacgacaac gaccccgcca 120C 

aaagacttgt tctggagcat cttcctcgtc 1260 

gagcatttcg cttactatag gattnagaag 1320 

ctattgccct cgtatgactg cagcgtacca 1380 

ccaaatgctt ttgcggcacg catacgcttg 1440 

ttatactcag caaaatccag caaaatggaa 1SGC 

attgctcgaa aacttagcca gtggcacgtg 1560 

ccgaataggc ctctcgaaga gtatatctgt 1620 

tgttgggtac ttctgcataa acgcatttgg 1680 

cacgctcggg attgtctgat gctgttcaag 1740 

agcaatttcg acaggtaagt cttccgtgcg 1800 

gtacttcata gcattgtcct gaactattct 1860 

attgtggagg aagaccagcc gatccattca 1920 

atatacacga accgttcggc atccaatagg 1980 

caagtggcca gtcgctgtag cgatattgcc 204C 
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ctcctcctcc 
acgccctcat 
acaactagca 
ggccaggtct 
aczcc tt ctg 
t tccccgact 
actagccaaa 
ctaaatacac 



agaatttaac 
ggaa cgagcc 
ccggctcttc 
gggaca tats 
aggatgcaac 
tttcacttgg 
gcgaa 1 1 egg 
ccatttao 



egagaggegt 
aacctaca tg 
a ta taac 1 1 g 
cttcaacccg 
ccagga t t 1 g 
cat cgacaac 
:cttattacc 



1 1 ta t tccga 
ga ttacga tg 
aa ta t cagee 
agagaaa tac 
c tgagcaa t g 
t tctccgacr 
gaggaggaca 



caacgata tc 
t cgccaa tgc 
cget t ggtgt 
caatgga t gg 
atcctggcca 
t nccact t gg 
taac t cga ta 
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acgaagtcca 2100 
gtccactagc 2160 
acccggagac 2220 
tacaattgeg 2280 
atgccttggt 2340 
cattgacatg 2400 
tgagagacta 2460 
2478 



<210> 32 
<211> 1302 
<212> DNA 

<213> Aspergillus terreus 
<400> 32 

atggagtcta aagtccagac aaatgttcca 
cgtgggaagc gtgtatgtgt tccttcttgg 
ttctcttaat atatgtatag acgaaaggca 
gggcagttga aatctccatc acctaccctt 
ttcegcttta ccgtggccgc caactgacag 
tcgcgcccag ettaagegga gaaaccatga 
ctggtatgct gggtatggag ccaccttggt 
gtagegatec cccttataag cccgcgtgga 
acagaatttg cctccttcaa tatttatege 
tcaactggag cttccgtcct ggctgggttt 
gtaaccccag eggaggegat caagacgaaa 
gatcagttct taattcgttc tcttagcatt 
agtacgactt teggegegat agctggaatc 
tctgcggttg gtcctacaat tttgcggcag 
tataacgaac ttattgggct ggcccgaaaa 
ctggcaagca ecttggtegg ttctgttact 
ctggacgtga teaagacgeg gtaagtgctg 
tatatgetga cttggatgcg acttegggtt 
ctgtacggaa atacctttaa ctgcgtgaaa 
ttctggtccg gcgtctggtt teggacaggg 
cccgtgtaag tttaggagta atctacaggc 
aggtacgaga aagtctacaa gttcttgacg 



<210> 33 
<211> 1529 
<212> DNA 

<213> Aspergillus terreus 
<400> 33 

atgaccaagc aatctgcgga cagcaacgca 
tgggcatcca acctggccac tgacgacatc 
cttattctcg atggtattgc atgtgcctgg 
tatgtgcagg caacaatgag etttgagecg 
caagttagtt ctatccaatc tgaacagtct 
aactggggee tgttgcagca gccatgacca 
acgactacca cagcgaagcc cccctacact 
cagcaagtga ggtcttagee gagcagggca 
ccgccattgt ggggtttgaa tctggcccgc 
tcttgaacaa eggctggcat tgtggagccg 
caggaaagct ccttggtctg actccagact 
cgcaagcctg tggcttaatg teggegcaat 
gattegcage gcgtaatggt cttcttgggg 
tgaagggtgt cctggagaga tettaeggeg 
geagagagee tccctacaaa gaggaggaag 
cctttactat tegcatcaag ctctatgcct 
ctatcgaaaa ccttcagagg aggtaccccg 
ttcgccacgt tcatgtacag ctttcaacag 
aggagagacc catcagttca ategcaggge 
agttggtcga ccagcaatgt cttctggccc 



ttaccaaagg caccccttac ccaaaaagcc 60 
tcgcgcgtgt gccatgttac tgacaatgtc 120 

ucctgcact ggtcgcgggt gcttgtgctg 180 

tcgaatgtga gcttatcctg tgtttaagag 240 

tetattgett ccgctggtag eggctaaaac 300 

tgtggcagct ataaaacctg gaatccgagg 360 

aggaaccaca ttgaaagect ccgttcgtat 420 

gcgaaaagga atgaccgttt gcaataacaa 480 

tcggccctct cgggcccaaa tggagagctc 540 

ggggctggcg tgaccgaggc tgtcttagee 600 

atgtaagttg caacatctca cccgttatcc 660 

gatgeaagga aggttggaaa tgcagagtta 720 

cttcgagatc ggggaccget tggattcttc 780 

tegtccaatg cggcagtgaa gttcactgtt 840 

tactcgaaga aeggegaaga cgtgcaccct 900 

ggagtttgtt gcgcctggtc gacacagcca 960 

ctcagatcga cagtaacccg cccagataag 1020 

accagaatgc aatctcttca ggcaagacaa 1080 

acacttctgc gcaatgaagg cattggggtt 1140 

agactttccc ttacctcggc catcatgttt 1200 

atgatattct tgtacactga cagagegtea 1260 

cagccaaact ga 1302 



aagtcaggag ttaeggcega aatatgecat 60 
cctccggacg tattagaaag agcaaaatac 120 
gttggtgcaa gagtgccttg gtcagagaag 180 
ccaggggcct gcagggtgat tggatatgga 240 
acaaagtata ctgacgatcc tttgtataga 300 
attctgettt catacaggct aeggagcttg 360 
ctgeaagcat tgtcctccct gcggtctttg 420 
aaacaatttc tggtatagct gtcattctag 480 
ggatcggcaa agcaatctac ggateggace 540 
tgtatggtgc tccagccggt gcgctggcca 600 
ccatggaaga tgctctcgga ategegtgea 660 
aeggaggcat ggtcaagege gtgcaacatg 720 
gactgttggc ccatggtggg tacgaggeca 780 
gtttcctcaa aatgttcacc aagggcaatg 840 
tggtggccgg teteggttea ttctggcata 900 
getgeggact tgtccatggt ccagtcgaag 960 
agctcttgaa tagagecaac ctcagcaaca 1020 
cctcgaacag tcactgtgga tggataccag 1080 
agatgagtgt cgcatacatc ctcgccgtcc 1140 
agttttccga gtttgatgac aacttggaga 1200 
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ggccagaagt grgcgatctc gccagcaagg ttactccatc tcatagcgaa gaqtttcatc 1260 



aagacggcaa ctgtcicact 
ttacggaaac tgrcgagaac 
ttcacaaata ccgaaccctc 
aggatcttgt cctcagcct c 



gccggtcgcg tgaggaznga gttcaacgat' gcctcttcrg 1320 

cctcttggag tcaaacagcc caccccaaac gaaccgantc 138 0 

gctggtagcg tgacggacga aacccgggcg aaagagattg 1440 
gacaggctca ccgacaz zac cccaztgctg gagctgctta 



attgtcccgt aaaatcgcca ctqgtataa 



1500 
1529 



<210> 34 
<211> 1704 
<212> DNA 

<213> Aspergillus cerreus 



<400> 34 

a tgggccgcg 

aacgagccag 

gttgttagt t 

cctgaaa teg 

gtct tggt tg 

aggeggacat 

gcggcgataa 

tcgttcatgg 

cgccct tgtc 

geegtaggtt 

gaaagtacta 

tcatcgtcac 

tggggctcgt 

caaeggegtt 

tgetcaagea 

ctctgcatct 

tectggegat 

ttcatttaac 

tga tagggag 

agegeggat t 

tgctgcctgc 

tagtgecca t 

tgaacactta 

gcccacaata 

teattgeagg 

tgcccgtcat 

tct tacctct 

ggat tgatca 

gctttcaacc 



gt gacactga 
agaaaaaggg 
ggatgactct 
cgaa tgaa 1 1 
cca tggga ta 
cat acaa t cr 
acgagaaaat 
tctcaggcca 
tccccaat tg 
tct tcatggc 
gaactcccaa 
t t tcacgagt 
get ttccctg 
taaaccgacc 
gtgggtgtat 
actggaaacg 
tacgeaatat 
gactgcccta 
tctcgtcggc 
ccgtctccct 
gggaacgetc 
aatcgcggcg 
cgtggctggt 
atgttgtctc 
aaagtata tg 
agacgccct c 
caa tgtaca t 
cggcggcca t 
tocctaccca 



gt ccccgaac 
ccgtgat a tt 
ag tegt tact 
tgata tgact 
t tea tccctc 
ggecatt tea 
gt teatageg 
aactgttct t 
cgaaaactaa 
egggactett 
caggaactga 
tggcgtgt ta 
ctat ttttcc 
acacttgttt 
ccaaatgtct 
aaatgetcat 
tcga tcctga 
gtatcgggtc 
ggtaaactt t 
cagga tcgac 
att tacgggt 
t tct tegegg 
ttgttccaca 
tgaccgcaag 
at tcaatact 
ggagt tggat 
gctcacagt t 
cgcacggtgg 
a tag 



ccagcgacga 
cca 1 1 a t gga 
Ctctccagta 
gtcgagacta 
ata tggggtc 
atgett tgtg 
t tcagagtat 
gcagatatct 
tccgttcgtg 
tctggccctg 
ctgt ttgat c 
tct tctggct 
cgaaaaccga 
caa tcata tc 
t tettgeegt 
gccgcaaaca 
ct tcagctcg 
tct tctacct 
cggatcgcac 
tccacagcgg 
ggacacteca 
gctgggggct 
ccctcat taa 
tagaagect t 
cctt ttctgc 
ggacgt tcac 
gttgcaggtg 
ggga taaa ta 



ecteggaagg 
gaaaa tgt g t 
ct tg tct tct 
tcaa tatctc 
ccatgaacaa 
cgtgctccgc 
tgageggett 
ttgagcctgt 
cgcaggt tta 
caataggtac 
aggcccctgc 
tcaactaggt 
aggaacttct 
gaaattctcc 
aagtgcctgg 
aaaggact ta 
tgctata t tc 
cgctccaggt 
cgttcggaga 
gctcatcaca 
agaggataag 
ca tgggcagt 
tttatcccct 
gccacggaac 
agggagtagt 
gcta tgta tg 
tggt tgette 
t gcaaaggtg 



tageggacaa 
car z aegt 1 1 
nccngccgcc 
caatgctggt 
gttagtegge 
tggaaeggea 
aaccggaacc 
acgcatcaca 
ccgtgggacg 
gtaccctgct 
gtgggagggg 
atgageggae 
gagaaggtct 
ecaaeggatg 
gaga tatgee 
tgctgtggcc 
aactcaeggt 
gccgggttcc 
tacatagtaa 
t tgtttgccg 
ggtgggatgg 
tttaactgee 
ttgtgtacat 
eggtctgeag 
gcgctcgttg 
gtacct t tct 
gactatagct 
ggcagaaagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1704 



<210> 35 
<211> 1704 
<212> DNA 

<213> Aspergillus terreus 
<400> 35 

atgggccgcg gtgacactga gtccccgaac 
aacgagccag agaaaaaggg ccgtgatatt 
gttgttagtt ggatgactct agtcgttact 
cctgaaatcg cgaatgaatt tgatatgact 
gtcttggttg ccatgggata ttcatccctc 
aggeggacat catacaatct ggecatttea 
gcggcgataa acgagaaaat gttcatagcg 
tcgttcatgg tctcaggcca aactgttctt 
cgcccttgtc tccccaattg cgaaaactaa 
geegtaggtt tcttcatggc egggactett 
gaaagtacta gaactcccaa caggaactga 
tcatcgtcac tttcacgagt tggcgtgtta 
tggggctcgt gctttccctg ctatttttcc 
caaeggegtt taaaccgacc acacttgttt 
tgetcaagea gtgggtgtat ccaaatgtct 



ccagcgacga ecteggaagg tageggacaa 60 
ccattatgga gaaaatgtgt cattacgttt 120 
ttctccagta cttgtcttct tcctgccgcc 180 
gtcgagacta tcaatatctc caatgctggt 240 
atatggggtc ccatgaacaa gttagtegge 300 
atgctttgtg cgtgctccgc tggaaeggea 360 
ttcagagtat tgageggett aaccggaacc 420 
gcagatatct ttgagcctgt acgcatcaca 480 
tccgttcgtg cgcaggttta ccgtgggacg 540 
tctggccctg caataggtac gtaccctgct 600 
ctgtttgatc aggcccctgc gtgggagggg 660 
tcttctggct tcaactaggt atgageggae 720 
cgaaaatcga aggaacttct gagaaggtct 780 
caatcatatc gaaattctcc ecaaeggatg 840 
ttcttgccgt aagtgcctgg gagatatgee 900 
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ctctgcatct 
tcctggcga t 
ttcatttaac 
tga tagggag 
agcgcgga 1 t 
tgctgcctgc 
t agtgccca t 
tgaacact t a 
gcccacaat a 
tcattgcagg 
tgcccgtcat 
tct tacctct 
gga ttga tea 
act ttcaacc 



actggaaacg 
tacgeaa tat 
gac t gece ta 
tct egtegge 
ccg tct ccct 
gggaacget c 
aa t cgcggcg 
eg tggct ggt 
a tgt tgtctc 
aaagta tatg 
agacgccctc 
caa tgt aca t 
cggcggcca t 
tgect accca 



aaatget cat 
tcga tcctga 
etategggt c 
ggtaaact 1 1 
cagga t ccac 
at t tacgggt 
t tct tegegg 
t tgttccaca 
t gaeegcaag 
a t tcaa tact 
ggagttggat 
gctcacagt t 
cgcacggtgg 
a tag 



gccgcaaaca 
ct tcagct eg 
t cttc tacc t 
eggat cgcac 
t ccacagcgg 
ggacactcca 
gctgggggct 
ccctca ttaa 
tagaagee 1 1 
cc 1 1 1 tct gc 
ggacgttcac 
gt tgcaggtg 
ggga taaa ta 



aaaggact ta 
tget a ta t tc 
cget ccaggt 
cgt teggaga 
gctcatcaca 
agaggataag 
ca tgggcagt 
t t tatcccct 
gccacggaac 
agggagtagt 
gctatgtatg 
tggt tget t c 
tgcaaaggt g 



tgctgtcccc 
aact caccgt 
geegggt tec 
tacatagtaa 
t tgt t tgccg 
ggtgggatgg 
t ttaactgee 
ttgtgtaca t 
eggtctgeag 
gcgctcgttg 
gtacctttct 
gacta tagc t 
ggcagaaagg 



96C 

1020 

108G 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1 680 

1704 



<210> 36 
<211> 1503 
<212> DNA 

<213> Aspergillus terreus 



<400> 36 

a tgaagcccg 

ctgaaeggea 

accaatagtc 

gcccacga tc 

actaacgaga 

ttccttcaga 

gaatcctatt 

atagctggca 

ggcgtctcgc 

gcgctagatg 

aacgacgaag 

ggggaccctg 

aaagggaat t 

aatggcccac 

gttgggactc 

acaaacgagc 

gt taaagcat 

atgggcatgc 

agcat tgt ta 

aacgtggt tc 

tt tgctgact 

tgtaccagac 

cccattgcag 

gaacgcgccg 

tcaactgcgc 

tat 



caa tccttat 

agctcacatt 

caccggacat 

tggccctgga 

ctca tccaac 

ggttgatagc 

ca t cagcact 

gtgacegteg 

cattgttctg 

ttcagaaagt 

cgcccgcct t 

cct tea tct c 

acctc tggee 

tageggacta 

ectgettgaa 

aatcccagtt 

t tatgaggga 

ggggtctagg 

gctggcagga 

aaccatttgt 

gecaggtctg 

caggtcacat 

aatgaaactg 

ccgcgctgt t 

geggcataca 



gaaatactgg 

gagtgagaca 

ctatatcgac 

ct t tgggege 

ategatggee 

ggatcataag 

ggtgttgggt 

tggggeca tc 

gtggacggat 

ccagggtcca 

gcataactgg 

aegtttctae 

ggcgatgtgg 

etaeggagtg 

agcccatgct 

t etaaaeggg 

gggtgtaatt 

egatgetgea 

a teegtgeta 

ccta tttgat 

tgacacagga 

tgatatatcc 

ggcgttctgg 

acaagtggat 

gccacggtgc 



ctcttcgtct 

aaggtgacgg 

cccgatgat t 

gtctttggta 

atcat cgctg 

etcgaegtta 

ccagccaaag 

tatggcttat 

gttaccccaa 

ccgtcagtga 

at tettgeaa 

gcccatgtct 

tcaaatatgt 

gtaatgggca 

gactacgaaa 

acgtgggact 

aggagecaac * 

tcgccgacac 

tcggacatcc 

gt taggatcc 

actgggaact 

tgatgacaa t 

gggcgcagga 

caacacagct 

tcagacagta 



cagctgtgag 

gggccgttca 

eggtctcagt 

aaaa tgecac 

gtaccataga 

ccagcatccg 

gcatacagaa 

acgatatatc 

ccaaacttga 

agtatcgtgg 

a tta tggega 

tcgagctgat 

tt tatgttga 

ctagccacac 

aagaaccgat 

ggat tagcaa 

actgggagac 

ttaaegcaac 

tgaa taaaac 

at tcaccct c 

tacta tgaga 

gcaggcaata 

atttactatc 

caactgatca 

tgggttgcca 



cgcgt caacc 

gctggct tgt 

ggttcgcgca 

agttcgcttc 

taagtcaacc 

tggccagtgg 

tgcgctagtc 

tgaacaaatt 

tgccatctac 

aatttttatc 

ggttgagaac 

cctgcgcctg 

tgacaccaac 

tggta tgacg 

ggctcgagca 

cgaggtcaat 

cgcatacaca 

agtggaagaa 

caacctgtcg 

aaa tatatcg 

geggcatgae 

tgetgegtet 

att ttgacat 

ggacctggga 

atattgggga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1503 



62 



NSDOCJD: <WO 0037629A2 I > 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
29 June 2000 (29.06.2000) 



PCT 



(10) international Publication Number 

WO 00/37629 A3 



(51) International Patent Classification 7 : C12N 15/81, 
9/88, C12P 7/42, C07K 14/38, C12N 9/10, 15/60, 1/15, 
1/19 

(21) International Application Number: PCT/US99/29583 

(22) International Filing Date: 

13 December 1999(13.12.1999) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

09/215,694 18 December 1998 (18.12.1998) US 

(71) Applicant: WISCONSIN ALUMNI RESEARCH 
FOUNDATION [US/US]; 614 Walnut Street, Madison, 
WI 53705 (US). 

(72) Inventors: HUTCHINSON, Richard, C; 4293 
South Deer Run Court, Cross Plains, WI 53528 (US). 
KENNEDY, Jonathan; Apartment 102, 401 North Eau 
Claire Avenue, Madison, WI 53705 (US). PARK, Cheon- 
seok; 11-11 Hwayang-Dong, Kwangjin-ku, Seoul (KR). 



(74) Agent: BAKER, Jean, C; Quarles & Brady LLP, 411 East 
Wisconsin Avenue, Milwaukee, WI 53202-4497 (US). 

(81) Designated States (national): AE, AL, AM, AT, AU, AZ, 
BA, BB, BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, 
DM, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, 
IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, 
LV, MA, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, 
RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, 
UG, UZ, VN, YU, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, C Y, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— With international search report. 

— Before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments. 

(88) Date of publication of the international search report: 

21 December 2000 

[Continued on next page ] 



^= (54) Title: METHOD OF PRODUCING ANTTHYPERCHOLESTEROLEM1C AGENTS 



Lovastatin production genes 



< 



ORF1: 

Esterase-like 



□ 



ORF2 

c=t> 



lovA: 
P450I 



lovB; 
NPKS 



lovC: 

Dehydrogenase ORF8* 

I lovD: HMG CoA 
ORF5 I Esterase Reductase 

<t=3 o3><£] <} i 



lovF: 
ScPKS 



ORF13: 
Zn 

ORF12 finger II 



lovE: 
Zn 

finger 



t> 



ORF10: 

Metabolite 

transport 

B2A4//- 



3> 



lovF: 
ScPKS 
I — x 



D4B 



ORF14: ORF15: 
Acetyl CoA Membrane 
transport ORF15 transport 

<tzi f=X> <J=] 



ORF17: 
P450II ORF18 

<£□ □ 
// 



1CM4 
lib 
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terreus D4B segment, wherein the segment is translated and where an increase in lovastatin production occurs. 



BNSDOCID: <WO 0037629A3 I > 



WO 00/37629 A3 MII'll:!ilJ] ilfJI'illlJ 1 I'll ! 1 E 1 1 ' hi ! II 1 1 HI 1 1. 1 II , |[ 111 II .1 1 1 II , l! 



For two-letter codes and other abbreviations, refer to the "Guid- 
a nee Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



1NSDOCID: <WO 0037629A3 I > 



INTERNATIONAL SEARCH REPORT 



Intern lal Application No 

PCT/US 99/29583 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C12N15/81 C12N9/88 C12P7/42 C07K14/38 C12N9/10 
C12N15/60 C12N1/15 C12N1/19 

According to International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N C07K C12P 

Documentation searched other than minimum documentation to the extent that such documents are included in the lields searched 
Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal, WPI Data, PAJ, BIOSIS, MEDLINE, CHEM ABS Data, SCISEARCH, EMBASE, 
STRAND, GENSEQ, EMBL 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5 744 350 A (DAVIS CHARLES RAY ET AL) 
28 April 1998 (1998-04-28) 
cited in the application 
claim 6; examples 18,19,27-29 



EP 0 556 699 A (N0VOPHARM LTD) 
25 August 1993 (1993-08-25) 
claims 1-13; examples 1,2; table 1 



WO 98 48019 A (DIEZ GARCIA BRUNO 
; FERNANDEZ CANON JOSE MANUEL (ES); MINGOT 
ASCENC) 29 October 1998 (1998-10-29) 
examples 1,2 
SEQ ID NOs: 1-4 



-/- 



16,17, 
24-27 

1-U,23, 
28,29 

28,29 

1-11, 
16-27 

23 



j j Further documents are listed in the continuation of box C. 


j)( j Patent family members are listed in annex. 


• Special categories of cited documents : 

"A* document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

*L" document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 


T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X' document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

*Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

*&• document member of the same patent family 


Date of the actual completion of the international search 

4 October 2000 


Date of mailing of the international search report 

T?. 10. no 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

ALC0NADA R0DRIG. . , A 



Form PCT/ISA/210 (second sheet) (July 1992} 

BNSDOCID: <WO 0037629A3 I > 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



0 



Inten nal Application No 

PCT/US 99/29583 



C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, ot the relevant passages 



Relevant to claim No. 



MANZONI MATILDE ET AL: "Production and 
purification of statins from Aspergillus 

tGrPGUs strains " 

BIOTECHNOLOGY TECHNIQUES JULY, 1998, 

vol. 12, no. 7, July 1998 (1998-07), pages 

529-532, XP000921032 

ISSN: 0951-208X 

the whole document 

WO 97 00962 A (GRAAF LEENDERT H DE ;BR0ECK 

H C DEN (NL); PEIJ NOEL N M E (NL); VI) 

9 January 1997 (1997-01-09) 

page 16, line 30 -page 17, line 23 

page 18, line 9 -page 24, line 7 

SEQ ID NO. 9 

DATABASE SWISSPR0T 'Online! 

1 January 1998 (1998-01-01) 

OLIVER ET AL.: "Putative tricarboxyl ate 

transport protein C19G12.05 from fission 

yeast . " 

XP002i49143 

Accession 013844 

DATABASE GENEMBL 'Online! 

13 May 1997 (1997-05-13) 

VAN PEIJ ET AL.: "beta-xyl osidase, xlnD 

gene from Aspergillus nidulans" 

XP002149144 

Accession Z84377 

DATABASE SWISSPR0T 'Online! 

1 October 1996 (1996-10-01) 

MURPHY ET AL.: "hypothetical 59.3 KDA 

protein C17C9.16C in chromosome I from 

Schizosaccharomyces pombe" 

XP002149145 

Accession Q10487 

KENNEDY JONATHAN ET AL: "Modulation of 
polyketide synthase activity by accessory 
proteins during lovastatin biosynthesis." 
SCIENCE (WASHINGTON D C) MAY 21, 1999, 
vol. 284, no. 5418, 

21 May 1999 (1999-05-21), pages 1368-1372, 
XP000914559 
ISSN: 0036-8075 
the whole document 



1-11, 
16-29 



12-15 



23 



23 



23 



1-11, 
16-29 



1 



Forni PCT/ISA/210 (continuation ol second sheet) (Joty 1992) 

NSDOCID: <WO 0037629A3 I > 



page 2 of 



2 




INTERNATIONAL SEARCH REPORT 



In. .ational application No. 

PCT/US 99/29583 



Box I Observations where certain claims were found unsearchable'(Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. | | Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



| | Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 

1 - I y I As all required additionaJ search fees were timely paid by the applicant, this International Search Report covers all 
searchable claims. 

2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
1 ' covers only those claims for which fees were paid, specifically claims Nos.: 



1 1 No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| X j No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-11, 16-22, 24-29 (complete) and 23 (partially) 

The D4B gene cluster from Aspergillus terreus comprising the 
0RF1, 0RF2, lovA, lovB, 0RF5, LovC, lovD, HMG CoA reductase, 
LovE, 0RF10 and part of the lovFA genes involved in the 
biosynthesis of lovastatin. Uses thereof in a method for 
increasing the production of lovastatin in a 
lovastatin-producing organism, for increasing the production 
of monacolin J in a lovastatin producing organism, and for 
increasing the production of monacolin J in a 
non-1 ovastat in-producing organism; fragments of the D4B gene 
cluster comprising the gene encoding for the esterase-1 ike 
gene (0RF1, SEQ ID N0:20), the gene encoding 0RF2 (SEQ ID 
N0:21), the lovA gene (SEQ ID N0:22), the gene encoding 0RF5 
(SEQ ID N0:23), the lovC gene (SEQ ID N0:24), the lovD gene 
(SEQ ID N0:25), the gene coding for the HMG CoA reductase 
(SEQ ID NO:26), the lovE gene (SEQ ID N0:27), the gene 
encoding ORF10 (SEQ ID N0:28) and the lovB gene (SEQ ID 
N0:29); a lovastatin-producing organism genetically modified 
to increase lovastatin production and a 
non-1 ovastatin-producing organism genetically modified to 
produce monacolin J or to produce lovastatin. 



2. Claims: 12-15 (complete) 

A method of increasing the production of lovastatin in a 
lovastatin producing organism comprising the step of 
transforming an organism with the LovE gene from A. terreus. 



3. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:30) encoding the 0RF12 polypeptide (SEQ ID NOrll). 



4. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
NO:31) encoding the zinc finger polypeptide of SEQ ID NO: 12. 



5. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:32) encoding the acetyl-CoA transport polypeptide of SEQ 



6. Claim : 23 (partially) 
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FURTHER INFORMATION CONTINUED FROM PCT/1SA/ 210 



An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:33) encoding the 0RF15 polypeptide (SEQ ID N0:14). 



7. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:34) encoding the membrane transport polypeptide of SEQ ID 



8. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:35) encoding the P450 polypeptide of SEQ ID NO: 16. 



9. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:36) encoding the 0RF18 polypeptide (SEQ ID NO: 17). 
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A 


16-02-1999 


AU 


6833398 


A 


13-11-1998 






A 
rt 
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c.? 1U I?y<J 


CN 


1226932 


T 


25-08-1999 


CZ 


9804172 


A 


17-03-1999 


EP 


0922766 


A 


16-06-1999 


LT 


98186 


A 


25-08-1999 


LV 


12295 


A 


20-06-1999 


LV 


12295 


B 


20-10-1999 


PL 


330464 


A 


24-05-1999 


SK 


170598 


A 


12-07-1999 


ZA 


9803216 


A 


22-10-1998 


AU 


712559 


B 


11-11-1999 


AU 


6244296 


A 


22-01-1997 


AU 


714511 


B 


06-01-2000 


AU 


6244396 


A 


22-01-1997 


BR 


9608910 


A 


04-05-1999 


CA 


2224624 


A 


09-01-1997 


CA 


2224626 


A 


09-01-1997 


EP 


0833928 


A 


08-04-1998 


EP 


0833922 


A 


08-04-1998 


JP 


11507837 


T 


13-07-1999 


JP 


11507838 


T 


13-07-1999 


WO 


9700964 


A 


09-01-1997 


PL 


324186 


A 


11-05-1998 


PL 


324221 


A 


11-05-1998 
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